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(54) Title: PROTEOME MINING 
(57) Abstract 

The present invention relates to a method and apparatus for 
screening diverse arrays of materials for bioactive compounds. 
In particular, techniques for rapidly characterizing compounds 
in arrays of materials in order to discover and/or optimize new 
materials with specific desired properties are provided. The 
figure represents one of the embodiments of the current invention 
method for isolating bioactive compounds from a complex 
mixture of proteins using an immobilized combinatorial library. 
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Proteome Mining 

US Government Rig hts 

This invention was made with United States Government support 
under Grant No. DK52378A, awarded by the National Institutes of Health. The 
United States Government has certain rights in the invention. 

Field of the Invention 

The present invention is directed to high throughput screening of 
proteomics for identifying bioactive compounds. More particularly the invention is 
Erected to the isolation of novel herbicides, antibiotics, antifungals, antivirus, 
insecticide or pharmaceuticals based on their interactions with target molecules. 

Background nf the Invention 

The animal, plant, prokaryotic and viral kingdoms contain within them 
a vast array of genes that express 100,000's of distinct proteins whose biological 
function is essential life. The number of genes contained with in a particular 
organism varies greatly. Generally, the simpler the organism the fewer the total 
number genes. For example, completion of the yeast genome shows that these 
organisms have about 8300 genes, the complete C. elegans genome contains about 
18,000 genes, and the human genome is estimated to contain about 100,000 distinct 
genes. Each gene encodes a specific protein which has a predetermined essential 
function for the over all survival of the organism. Collectively, given the biodiversity 
that exists on earth, the numbers of distinct genes that exist in nature is likely to 
25 number in the billions. 

Obviously not all of the proteins expressed by the genes of an organism 
are lucely to be of importance to man. Indeed the number of genes that are likely to 
express proteins of commercial or medical value is a tiny fraction of this vast 
biodiverse gene pool. Methods therefore that allow one to rapidly and effectively 
30 screen large numbers of proteins within this pool for valuable proteins are of great 
importance. In the case of the human genome it has been estimated that 
approximately 4000 of its genes are responsible for the causes of non-pathogen 
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induced human disease. In short this means that human tissue contain 4000 proteins 
of potential medieal and commercial value. The same analogies can be made to other 
species. For example if one was screening for a new antibacterial agent, one would be 
looking for bacterial protein targets that were peculiar to the particular bacterial strain 
> of interest. In the case of bacteria this has traditional been enzymes involved in the 
synthesis of the bacterial cell wall. The classic example of a drug that is selective for 
bacteria is penicillin which inhibits an essential enzyme required for synthesis of the 
bacterial cell wall. Humans do not possess any of the enzymes that make bacterial 
cell walls. One cannot simply target any bacterial protein when searching for new 
antibiotics, this is because even though there are many differences between humans 
and bacteria, a significant portion of the bacterial genome encodes proteins of similar 
structure or function as found in humans. Drugs that inhibited proteins with a 
common function in both organisms are unlikely to discriminate between the two 
species. 

To identify new drugs or commercially important bio-active molecules 
one needs methods that have the ability to encompass entire species genomes and 
immediately identify candidate proteins of importance. The present invention is 
directed to a method of identifying compounds that selectively interact with important 
biological components. This selective interaction is an essential element that makes a 
particular chemical have medical or commercial value. Without selectivity a 
compound has no bio-active value; selectivity is the single most important factor in all 
drug, antibiotic, antifungal, antiviral, insecticide and herbicide action. 

The selectivity of a valuable bio-active compound in 99% of all cases 
is based on its interactions with one or more specific proteins contained within the 
target cell or organism of interest. One or two percent of valuable bio-active 
molecules maybe directed towards non-protein targets such as DNA, RNA, lipids or 
sugars. Without exception a valuable bio-active chemical interacts with its protein 
target in a highly specific manner. The target protein will contain on its surface a 
domain or pocket that binds the chemical with high affinity. This domain or pocket is 
unique to the target and not the several 1000 proteins that may also be contained with 
in the cell expressing the target protein. 

In most cases the binding site for the chemical on the target protein is 
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important to the biological activity of the protein. The binding site may be required for 
enzymatic activity, or be site of hormone interaction (e.g. a receptor) or a binding site 
for an all osteric regulator. When a chemical binds to one of these sites and affects 
the biological activity of the target protein it invariably contains structural components 
that resemble the natural biological ligand that normally occupies the site. The 
affinity of the interaction of the chemical for the target protein generally increases as 
the overall structural components of the chemical mimic the natural ligand. In some 
cases bio-active molecules fit better into the natural ligand-binding site than the 
natural ligand itself. These types of molecules are likely to have extremely potent bio- 
activity. If these molecules possess structural features that prevent their metabohsm 
then this increases their bio-activity even greater. 

One of the primary mechanisms for identifying bio-active chemicals of 
medical or commercial value is to screen large combinatorial libraries with some form 
of an enzymatic or biological assay. Combinatorial libraries can be extremely diverse 
and contain many hundred thousands of distinct molecules of known or unknown 
structure. They can be derived from very diverse sources, including plant extracts, 
animal extracts, soil samples, bacteria, fungi, chemical industry byproducts etc. 
Theoretically, these libraries contain within them molecules of every conceivable 
shape and form. However, like the proteins to be targeted, only a small percentage of 
these libraries contain molecules that have important bio-activity. 

Prior high through put screens for drug discovery begin with a disease, 
the choice of which is invariably determined by potential market size. The etiology of 
the disease is first defined by basic research to determine likely underlying cause. 
This research identifies potential protein targets that may be useful drug targets; e.g. 
receptors or enzymes. The purified receptors or enzymes are then used to screen 
chemical libraries for agents either bind, inhibit or activate. Similar approaches are 
also used to screen for anticancer drugs. Transformed cell lines are used to screen 
large chemical libraries that may contain compounds that revert them to their normal 
phenotype or kill the cancer. Further investigation is then used to identify the 
molecular mechanisms by which active compounds from these screens bring about 
their cellular effect. 

Therefore in the traditional search for a new bio-active compounds one 
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begins with a specific biological problem in mind. For example a particular 
pharmaceutical house maybe focused on discovering new antihypertensive agents. Its 
decision to enter into the screening process is always based on the disease market size. 
Thus the search for drugs that would treat small populations of afflicted individuals is 
5 unlikely to happen in the private sector. In the case of a new antihypertensive agent 
for example, a drug screen will generally begin with an assay that includes a specific 
receptor or enzyme that has important factions in the regulation of blood pressure. 
One then has to hope that the libraries one screens contain bio-active molecules that 
selectively effect the proteins selected in the assay. Once candidate chemicals are 
1 0 identified one then has to demonstrate that these compound act selectively and 
predictably for the targeted protein in the assay and not others. Thus in the initial 
stages one ends up with many false positives which must be eliminated in a second 
round of screening because the entire expressed genome was not taken into account in 
the first instance. The invention described herein eliminates this problem at the start 
15 because it encompasses both the diversity contained within the chemical library to be 
screened, with the diversity of the expressed genome itself in one step. 

The selectivity is achieved in the analysis following sequencing of the 
targeted proteins. A decision as to whether a particular protein/chemical interaction is 
likely to have commercial and medical value is made during the last stages of 
20 analysis. Therefore, in addition to identifying bio-active agents that have commercial 
value the screen of the present invention does not exclude compounds that may have 
humanitarian value. This is because we could conceivably identify agents that bind to 
proteins important in the pathology of obscure diseases with small patient 
populations. 

25 Final, y> ^ P resent invention can readily cross platforms with no 

change in protocol or equipment. There is no difference in screening procedures for 
herbicides, antibiotics, antifungals, antivirus, insecticide or pharmaceuticals. All one 
changes is the expressed genome (proteome) that is to be screened. One can even use 
the same libraries for each screen; i.e. a library that did not yield any useful 

30 pharmaceutical agents may contain a useful herbicide. 
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Summarv of th<> Tnvpntinn 

The present invention is directed to compositions and methods for 
identifying bioactive compounds. Advantageously, the present method of identifying 
bioactive compounds utilizes both the diversity contained within a chemical library to 
be screened, with the diversity of the expressed genome itself in one step to maximize 
the efficacy of the screening procedure. The method comprises the steps of contacting 
an immobilized combinatorial library with the protein members of a proteome, 
characterizing the proteins that interact with members of the library to identify those 
proteins having important biological value, and isolating the corresponding compound 
from the library that interacts with a protein having important biological value. 

Brief Desc ription of the Drawinp s 

Fig. 1 is a diagramatic representation of the steps used in accordance 
with one embodiment to isolate bioactive compounds from a complex mixture of 
proteins (proteome) using an immobilized combinatorial library. 

Fig. 2 is a diagramatic representation of the steps used in accordance 
with one embodiment to isolate bioactive compounds from a complex mixture of 
proteins (proteome) using an immobilized combinatorial library. 

Fig. 3 is a diagramatic representation of the steps used in accordance 
with one embodiment to isolate bioactive compounds from a complex mixture of 
proteins (proteome) using an immobilized combinatorial library. 

Fig. 4 is a diagramatic representation of the steps used in accordance 
with one embodiment to identify cell surface receptors and their peptide ligands en 
masse from a predetermined cell type. 

Fig. 5 represents the stained SDSPAGE results from characterization 
of proteins isolated from rabbit skeletal muscle through the use o f gammaphosphate _ 
lin ked ATP-Sepharose, Rabbit skeletal muscle extract was prepare7from350gof 
tissue (w/w) and passed over 50 mis of gainmapho sphate linked ATP-Sepharose 
containing 10 umols/ml of linkedATP. Following washing, the column was eluted 
sequentially with NADH, AMP, ADP and ATP and fractions collected (lOmls). 
Column fractions were separated by SDSPAGE then transfered to PVM and stained 
with amido black. Proteins 1-17 were identified by mixed peptide sequencing (see 
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Table 1). 

Fig. 6 represents the stained SDSPAGE results from geldanamycin 
released muscle extract proteins. ATP-Sepharose was loaded with skeletal muscle 
extract and eluted successively with the indicated concentrations of geldanamycin, 
followed by 1 0 mM ATP. Peak fractions (20ul of 1 .0) were analyzed by SDSPAGE 
and silver staining or transferred to PVM for identification by peptide sequencing. 
Numbers indicate that proteins that wee identified on the PVM membrane: 1 . HSP90 
and proteolytic fragments of HSP90; 2. purine synthetase (ADE2); 3. myosin light 
chain kinase; 4. phosphorylase kinase; 5. p98 glucose indued kinase; 6. HSP70; 
arginine succinate synthetase; 7. glutamate dehydrogenase; 8. glutamate ammonium 
ligase; 9. glutathione sythetase; 1 0. aldehyde dehydrogenase; ll.MAPK; 
12.GAPDH; 13.PKA 
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Detailed Description of the Invention 

In describing and claiming the invention, the following terminology 
will be used in accordance with the definitions set forth below. 

As used herein, "nucleic acid," "DNA," and similar terms also include 
nucleic acid analogs, i.e. analogs having other than a phosphodiester backbone. For 
example, the so-called "peptide nucleic acids," which are known in the art and have 
peptide bonds instead of phosphodiester bonds in the backbone, are considered within 
the scope of the present invention. 

As used herein, bioactive compounds include any compound that is 
capable of inducing an effect on a living cell or organism. Bioactive compounds 
include but are not limited to pharmaceuticals, hormones, chemotherapeutics, nucleic 
25 acids and the like. 

As used herein the term "proteome" relates to a complex mixture of 
proteins that are derived from a common source, such as an extract isolated from a 
particular cell or tissue. For example a human proteome represents a mixture of 
proteins isolated from human cells. The category can be further defined by specifying 
30 a particular cell/tissue source for the proteome (i.e. a human myocardial tissue 
proteome represents all the proteins isolated from human myocardial tissue). 

As used herein the term -combinatorial library" relates to a collection 



WO 00/63694 



PCT/US00/09714 



-7- 



15 



of compounds. The combinatorial library can be a biological synthesized library that 
comprises nucleic acid sequences that include a common vector sequence (allowing 
for replication of the library in a host species) and a protein encoding region. The 
biological synthesized library can be further provided with regulatory elements that 
5 allow for the expression of the encoded proteins (i.e. an expression library). Chemical 
libraries are collections of compounds that were isolated from a natural source or were 
synthesized in a laboratory using chemical or biological processes. A "combinatorial 
chemical library" is a collection of compounds created by a combinatorial chemical 
process, wherein the compounds of the combinatorial chemical library have a 
1 0 common scaffold with one or more variable substituents. 

As used herein the term "solid support" relates to a solvent insoluble 
substrate that is capable of forming linkages (preferably covalent bonds) with soluble 
molecules. The support can be either biological in nature, such as, without limitation, 
a cell or bacteriophage particle, or syntheuc, such as, without limitation, an 
acrylamide derivative, agarose, cellulose, nylon, silica, or magnetized particles. 

As used herein the term "naturally-occurring" relates compounds 
normally found in nature. Although a chemical entity may be naturally occurring in 
general, it need not be made or derived from natural sources in any specific instance. 

As used herein the term "non naturally-occurring" relates to 
compounds rarely or never found in nature and/or made using organic synthetic 
methods. 

As used herein the term "functional analog" of a library 
compound/ligand relates to a compound that has a binding affinity for the same ligand 
as one of the members of the library, such that the functional analog will compete 
25 with the library component for binding to that ligand. 

The present invention is directed to a novel method for the rapid 
identification of bioactive compounds, including but not limited to novel drugs, 
antibiotics, antifungals, antiviral, insecticide or herbicides. The overall strategy 
30 behind the invention is to screen complex protein mixtures with an immobilized 

library of compounds for proteins that bind specifically to components in the library. 
The bound proteins are then identified by protein microsequencing to determine if the 
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identified protein is therapeutically relevant. The protein has therapeutic relevance if 
the protein is known to be central to the development of a disease, is a metabolic 
enzyme unique to a particular microorganism, yeast, virus, or fungi, or is an enzyme 
peculiar to a type of insect, or an enzyme required for photosynthesis in a particular 



weed. 



The advantage of the present screening methodology derives from the 
initial assumption that the entire genome is a potential drug target. The only decision 
prior to screening is to decide what proteome should be utilized; i.e. for drugs 
important in human disease, human tissue is the choice, for herbicide a plant species, 
1 0 for antibiotic, a bacterial strain. 

In one aspect of the invention, systems and methods are provided for 
rapidly screening a combinatorial library for bioactive agents. The method is based on 
the identification of those library components that interact with proteins of a 
preselected proteome, wherein the proteome protein is a potential target for 
therapeutics. The method of identifying bioactive compounds comprises the steps of 
contacting a combinatorial library with the protein members of a proteome under 
conditions that allow for specific interactions between proteins of the proteome and 
the bound library. Proteins that interact with the immobilized library components are 
then isolated and analyzed to determine if the protein is interesting from a therapeutic 
standpoint. Those proteins that have therapeutic relevance are then used to identify 
the component of the immobilized library that interacts with the protein. 

In one embodiment, the method of identifying bioactive compounds 
present in a compound library comprises the steps of contacting an immobilized 
compound library with the protein members of a preselected proteome, and washing 
the immobilized compound library with a buffered solution. In one embodiment the 
immobilized compound library comprises a column of particulate solid support, such 
as sepharose or agarose beads, that has the individual components of the compound 
library bound to the support, and the wash comprises the use of a low ionic or high 
ionic buffer. In one embodiment the column is washed with both a high ionic buffer 
and a low ionic buffer. After the solid support has been washed with buffer, the 
remaining bound proteins are released from the solid support by contacting the bound 
proteins with one or more individual members of the compound library or with 
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functional analogs of the library components. Alternatively, the bound proteins can be 
released from the immobilized library through the use of a chaotropic agent, including 
but not limited to detergents such as SDS, TritonX, sarkosyl, denaturants such as urea, 
or chelators such as EGTA or EDTA. The released proteins are then identified by 
5 protein sequencing or mass spectrometry; and the identity of the specific compounds 
of the compound library that bind to the released proteins is determined. 

The libraries 

Combinatorial libraries can be constructed using techniques known to 
1 0 the skilled practitioner to provides researchers vast number of chemical candidates to 
screen for potential bioactivity. In accordance with the present invention the library 
comprises a collection of compounds that are capable of specific binding to their 
target. For example, suitable library components include, but are not limited to 
peptides, proteins, carbohydrates, lipids, glycoproteins or nucleic acids. 
1 5 Biologically synthesized combinatorial libraries have been constructed 

using techniques of molecular biology. These library components are expressed using 
bacteria or bacteriophage particles. For example, U.S. Pat. No. 5,270,170 and 
5,338,665 to Schatz describe the construction of a recombinant plasmid encoding a 
fusion protein created through the use of random oligonucleotides inserted into a 
cloning site of the plasmid. In other biological systems, for example as described in 
Goedell et al., U.S. Pat. No. 5,223,408, nucleic acid vectors are used wherein a 
random oligonucleotide is fused to a portion of a gene encoding the transmembrane 
portion of an integral protein. Upon expression of the fusion protein it is embedded in 
the outer cell membrane with the random polypeptide portion of the protein facing 
25 outward. Thus, in this sort of combinatorial library the compound to be tested is 
linked to a solid support, i.e., the cell itself and the cell itself adheres to the cell 
culture substrate. The Goedell patent is incorporated herein by reference. 

Similarly, bacteriophage display libraries have been constructed 
through cloning random oligonucleotides within a portion of a gene encoding one or 
30 more of the phage coat proteins. Upon assembly of the phage particles, the random 
polypeptides also face outward for screening- Such phage expression libraries are 
described in, for example, Sawyer et al., 4 Protein Engineering 947-53 (1991); 
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Akamatsu et al., 151 J. Immunol. 4651-59 (1993), and Dower et ah, U.S. Pat. No. 
5,427,908. These patents and publications are incorporated herein by reference. 

While synthesis of combinatorial libraries in living cells has distinct 
advantages, including the linkageof the compound to be tested with its nucleic acid, 
5 there are clear disadvantages to using such systems as well. The diversity of a 

combinatorial library is limited by the number and nature of the building blocks used 
to construct it; thus modified or R-amino acids or atypical nucleotides may not be able 
to be used by living cells (or by bacteriophage or virus particles) to synthesize novel 
peptides and oligonucleotides. There is also a limiting selective process at play in 

1 0 such systems, since compounds having lethal or deleterious activities on the host cell 
or on bacteriophage infectivity or assembly processes will not be present or may be 
negatively selected for in the library. 

Another approach to creating molecularly diverse combinatorial 
libraries employs chemical synthetic methods to make use of atypical or non- 

1 5 biological building blocks in the assembly of the compounds to be tested. Thus, 

Zuckermann et al., 37 J. Med. Chem. 2678-85 (1994), describe the construction of a 
library using a variety of N-(substituted) glycines for the synthesis of peptide-like 
compounds termed "peptoids". The substitutions were chosen to provide a series of 
aromatic substitutions, a series of hydroxylated side substitutions, and a diverse set of 

20 substitutions including branched, amino, and heterocyclic structures. This publication 
is incorporated by reference herein. 

Alternatively, chemical synthetic methodologies can be used to create 
large diverse libraries of potentially usefiil compounds and permits the synthesis of 
compounds joined to a solid support of some kind or joined to an identifiable marker 

25 such as a flourescent tag. In accordance with one embodiment, the combinatorial 

library is chemically synthesized on solid supports in a methodical and predetermined 
fashion, so that the placement of each library member gives information concerning 
the synthetic structure of that compound. Examples of such methods are described, 
for example, in Geysen, U.S. Pat. No. 4,833,092, in which compounds are synthesized 

30 on fiinctionalized polyethylene pins designed to fit a 96 well microtiter dish so that the 
position of the pin gives the researcher information as to the compound's structure. 
Similarly Hudson et al., PCT Publication No. W094/05394, describe methods for the 
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construction of combinatorial libraries of biopolymers, such as polypeptides, 
oligonucleotides and oligosaccharides, on a spatially addressable solid phase plate 
coated with a functionalized polymer film. In this system the compounds are 
synthesized and screened directly on the plate. Knowledge of the position of a given 
5 compound on the plate yields information concerning the nature and order of building 
blocks comprising the compound. 

Another approach has been the use of large numbers of very small 
derivatized beads, which are divided into as many equal portions as there are different 
building blocks. In the first step of the synthesis, each of these portions is reacted 
10 with a different building block. The beads are then thoroughly mixed and again 
divided into the same number of equal portions. In the second step of the synthesis 
each portion, now theoretically containing equal amounts of each building block 
linked to a bead, is reacted with a different building block. The beads are again mixed 
and separated, and the process is repeated as desired to yield a large number of 
1 5 different compounds, with each bead containing only one type of compound. This 
methodology, termed the "one-bead one-compound" method, yields a mixture of 
beads with each bead potentially bearing a different compound. The compounds 
displayed in the surface of each bead can be tested for the ability to bind with a 
specific compound (i.e. a protein member of a proteome). 
20 The libraries used in the present invention can be well defined, 

containing known mixtures of molecules, or the library can be one in which the 
chemical content is poorly defined. 

In accordance with one embodiment the libraries are immobilized on a 
solid support. Biological material, including but not limited to proteins, 
25 carbohydrates, nucleic acids, lipids, glycoproteins can be bound to a solid surface 

using standard techniques known to those skilled in the art. In preferred embodiments 
the library compounds are linked through covalent bonds. The solid surface can be 
selected from any surface that has been used to immobilize biological compounds and 
includes but is not limited to polystyrene, agarose, silica or nitrocellulose. In one 
30 embodiment the solid surface comprises functionalized silica or agarose beads. 

In accordance with one embodiment the components of a sample are 
bound to silica or agarose beads in separate reactions using different reaction reagents 
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and conditions to ensure that a diverse array of compounds are bound to the solid 
surface. Fractions of these separate reactions can then be combined to form a single 
affinity chromatography column. For example, a portion of the library can be reacted 
with an inert solid support (e.g. agarose, Sepharose, polystyrene or other 
5 chromatography beads) using standard protocols for linking primary amines to NHS, 
cyanogen bromide activated or maleimide activated resins. Many of these resins are 
commercially available, for example, Pharmacia CH-activated Sepharose. Another 
portion of the library is then reacted to an inert support that would select any 
molecules containing carboxylic acid residues. A commercially available resin in this 
1 0 case would be Pharmacia EAH-activated Sepharose. A third and fourth portion of the 
library could be linked through thiol (SH2), phosphate or aldehyde (CHO) containing 
residues. The goal is to link as many components as possible with in the library and 
in as many orientations as possible. The orientation of molecule when it is tethered is 
critical to its ability to interact with potential target proteins. Thus for some 
1 5 molecules reaction through primary amines may cause binding of a target protein to be 
sterically hindered because of the thether. However, tethering of the same molecule at 
a carboxyl residue may not hinder interaction with a target protein. 

The entire library can be linked to the resin or portions of the library 
linked can be linked separately. One should aim to achieve as reasonably high a 
20 ligand concentration as possible per immobilized molecule. Ideally this should be 

between 10 nmol to 1 umoVml. In the case of libraries in which the chemical content 
is poorly defined two mixtures from the library are prepared, a mixture that is soluble 
in organic phase and a mixture that is soluble in aqueous phase. The same linking 
strategy is then employed for the preparation of the resins from the organic and 
aqueous soluble library members. Each resin is then placed into a chromatography 
column and equilibrated into the protein extraction buffer. 

In another embodiment (shown in Fig 2), individual components in a 
library, or mixtures containing between 1-10 chemicals, are attached to beads 
separately in microtitre plate wells. Several 100 beads can be reacted in each well at 
30 the same time. One bead is then selected from each well and placed in a 
chromatography column. Again each ligand should be attached in multiple 
orientations. In cases where a target compound is identified that interacts with a large 
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number of proteins (e.g. ATP, see Fig 3) only the target compound is immobilized on 
the solid support. The bound proteins can be released by the addition of an excess of 
free target compound, a derivative of the target compound or a functional analog of 
the target compound (see Example 2). 

The proteomes 

The immobilized libraries are contacted with a proteome under 
conditions conducive to the formation of specific interactions between components of 
the proteome and components of the immobilized library. Choice of the proteome 
used depends on the problem being addressed. If one is looking to isolate a new drug 
for the treatment of a human disease the human proteome (i.e. human tissue) is the 
best choice. If one wishes to discover a new antibiotic then the target pathogen (e.g. a 
gram negative bacterium) of interest is the obvious choice. For an insecticide, the 
targeted insect and so on. 

In accordance with one embodiment the proteome comprises a natural 
products library which represents a collecuon of natural products that have been 
recovered from biological material and have been determined to have biological 
activity. For example the natural products library may include a mixture of natural 
products wherein the mixture is known to induce a phenotypic change in a population 
20 of cultured cells. 

The amount of starting tissue used to isolated the proteome proteins is 
critical and should be based on theoretical recovery of target proteins. For example, if 
one is interested in identifying drugs that may bind to signal transduction molecules 
one should take into account the amount of these proteins (or copy number) that may 
be in the cell. Many of these types of molecules may be expressed as low as 200 
copies per cell. A quick calculation predicts that if one wanted to recover as much as 
10 pmol of a particular protein that was expressed at 200 copies per cell one would 
require 1 2 grams of wet weight tissue. For high copy number proteins, such as 
metabolic enzymes, less tissue mass would be required to achieve 10 pmol of protein. 
One should also factor in potential losses due to inefficiency of extraction or 
proteolysis. Thus although 12 g of tissue may contain a total of 10 pmol of a target 
protein of interest one may only recover a small percentage of this in the initial screen. 
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Fortunately, modem protein detection and sequencing methods enable one to identify 
proteins in the femto molar range. However, increasing the starting tissue mass as 
much as possible will improve the odds of recovering sufficient protein for later 
identification. 

5 Tissue from the chosen target proteome is ground and homogenized in 

buffers appropriate for solubilizing proteins and retaining their native conformations. 
This involves standard procedures utilized in most biochemical laboratories. 
Following clarification by centrifugation the portions of the extraction mixture are 
passed over the immobilized chemical library resins. 

1 0 When interactions are sought between library compounds and cell 

surface proteins, it may be advantageous to investigate such interactions with the 
surface proteins in their natural state (i.e. embedded in the cell membrane). In 
accordance with one embodiment, the "proteome" represents the set of surface 
molecules displayed by cells cultured on a cell culture substrate, and may include 

1 5 proteins, glycoproteins, carbohydrates and lipids. 

In accordance with one embodiment certain components of the 
proteome can be removed prior to contacting the immobilized compound library with 
the proteome. Components can be removed, for example, by fractionating the 
components based on molecular weight, electric charge and/or hydrophobicity. 

20 Alternatively, specific components can also be removed by ligand or antibody 

binding. Such methods allow the removal of protein components that do not warrant 
further investigation but are know to bind to certain components of the target 
compound library. In addition such prescreening allows for the removaJ or reduction 
of proteins that are expressed at high levels in the tissue used to generate the 
25 proteome. 

The compounds of the proteome are placed in contact with the library 
component under conditions favorable for specific interactions between members of 
the two groups. The interaction may result in the alteration of a physical characteristic 
such as fluorescence, absorption, enzymatic activity, but typically the specific 
30 interaction involves binding of the two components to one another. In accordance 
with one embodiment the library components will be immobilized on a solid support 
and the proteome components will be solubilized or suspended in a solvent. The 
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solvent will then be incubated with the immobilied library for a time sufficient for 
specific interaction between the library components and the proteome components. 

In accordance with one embodiment the library components are 
covalently linked to a particulate solid support and the particulate is used to form a 
column. In this embodiment the proteome solution/suspension is passed through the 
column to provide contact between the library components and the proteome 
components. Some of the proteome components will bind to the immobilized library 
components. The immobilized library is then washed with a buffered solution to 
remove any non-specifically bound proteins. In accordance with one embodiment the 
step of washing the library comprises the steps of washing with a high ionic strength 
buffer and a low ionic strength buffer to remove proteins that may be associated 
because of non-specific ionic or hydrophobic interactions.. 

Isolation and identification of target proteins with a defined immobilized library 

In accordance with one embodiment the library comprises a defined set 
of compounds that have been covalently linked to sepharose/agarose beads (resin) via 
amine, carboxylic acid, thiol, hydroxyl, aldehyde or phosphate linkages and the beads 
are combined to form a column (see Fig 1). The protein mixture (proteome) is then 
applied to the column followed by washes of high and low strength buffers. In 
20 accordance with one embodiment the solutions/suspensions are allowed to percolate 
through the column based on gravity. Alternatively, additional force can be applied to 
speed the flow of the eluate through the column; for example the column can be spun 
in a centrifuge to enhance flow through the column. 

After the column has been washed with the buffered solutions, the 
25 beads (resins) are either kept in the columns or removed and placed in equal amounts 
into 96 well microtitre plates (see step 5, Fig. 1). Mamtaining the resins in a column 
has the advantage of potentially recovering more protein per library ligand, however, 
it has the disadvantage of being much slower procedure overall. 

When using the microtitre plate approach, the number of plates and 
30 wells used depends upon the number of components that are in the defined chemical 
library. This could be a few 100 to several 1000 to 10,000's. Individual components 
from the library are added at high concentration (milimolar at least) to each well. 
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Application of resin and subsequent addition of the library component can be 
automated using commercially available robots. Following addition of the individual 
components to each well the plates are agitated for a pre-determined length and 
temperature (e.g. 30 minutes at 30°C). . 
5 In the case of resins packed in chromatography columns, each 

component in the library (free of the solid support) is passed separately over the 
immobilized library and any eluted material collected. It is anticipated that a certain 
portion of the library components will be able to compete with the bound ligand and 
selectively liberate one or more proteins from the resins in the plates or columns. A 
1 0 portion of each eluate is then placed into a microtitre plate for protein analysis. 

Advantageously, using this method identifies the specific library component that binds 
to the released protein. Further analysis of the protein will determine if this binding 
interaction has potential therapeutic value. 

If the column is broken down and distributed into the individual wells 
15 of a microtitre plate prior to releasing the bound proteome proteins, an additional step 
must be taken to isolate the released proteins. Library components are added to each 
well and incubated to release the bound proteins. Following incubation with each 
library component, the resin is allowed to settle or the resin suspension is centrifuged 
briefly (300 x g) to pellet the beeds. In accordance with one embodiment the resin 
20 beads can be magnetized beads, and a magnetic field is applied to the bottom of the 
plate to hold the resin at the bottom. After the resin has been separated from the 
supernatant, a portion of the supernatant from each well is removed and placed in a 
second well containing a high sensitivity protein staining reagent. This last step can 
be automated using standard robots familiar to those skilled in the art. 
25 The protein detection reagent used to detect the presence of released 

protein can be any of those known to the skilled practitioner. In one preferred 
embodiment the detection reagent is one that changes color in the presence of protein. 
Radioactive isotopes that bind proteins (I 125 ), fluorophors (e.g. FITC) or gold stains 
may also be used to increase sensitivity. Specialized detection systems capable of 
30 detecting these markers are known to those skilled in the art. 

Wells/column fractions that are positive for protein are selected for 
further analysis using standard techniques. It is anticipated that the number of 
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positive wells would be a few percent of the total number of components that are in 
the library screened - although this may be several 100 if the total number of 
components in the initial library numbers in the 10,000's. The proteins will be 
subjected to gel electrophoresis analysis, typically using SDSPAGE, to measure the 
5 purity of the protein and quantitate the amount of recovered protein. The supernatant 
from each well that contains liberated protein(s) is mixed with SDSPAGE sample 
buffer and characterized by SDS gel electrophoresis. Many hundreds of supematants 
can be easily characterized using this method. The gel is stained with silver, 
Commassie or colloidal gold (for sequencing by mass spectrometry) or transferred to 
1 0 polyvinyl membrane (for mixed peptide sequencing). At this point the molecular 
weights and amount of protein recovered are determined. 

For mixed peptide sequencing, the proteins on the polyvinyl membrane 
(PVM) are stained then excised (See Darner et al., (1998) J. Biol. Chem 273: 24396- 
24405). The membrane pieces are digested briefly with CnBr, washed and placed 
1 5 directly into an automated Edman sequenator. Mass spectrometry can also be used 
but may be less desirable because of the amount of labor that is required and its 
inability to handle many protein samples at one time. In the case of mixed peptide 
sequencing between 6 and 12 rounds of Edman sequencing are carried out and the 
mixed peptide sequences generated sorted and matched against the databases with the 
20 FASTF (protein databases) and TFASTF algorithms (DNA data bases). This process 
identifies the liberated proteins in each well. 

At this point a determination is made as to whether or not the liberated 
protein is interesting (i.e. is the protein involved in a human disease, is it an important 
enzyme to bacterial metabolism., etc). If the protein is determined to have therapeutic 
25 value, then the chemical that liberated the protein from the immobilized library is 
chosen for further characterization using conventional approaches. For example, an 
affinity assay will typically be conducted to determine if the protein has sufficient 
affinity for the target library compound (i.e. binding at nanomolar concentrations) to 
be useful as therapeutic agent. In addition, analysis will be conducted to determine 
30 what is the biological impact of the interaction and whether the affinity of interaction 
with the targeted protein can be improved by modification. 

This screen may yield several candidate proteins that are valuable in 
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the initial round and not confuie one to a single field or market or interest. 
Importantly, a single round of screening will not only identify a potentially useful 
bioactive agent, but will also provide valuable information about its targets, what 
groups can be modified without affecting function and where the agents should be 
5 applied. 

In another embodiment of the present invention, library components 
are fractionated and linked to a solid support in separate vessels (see Fig. 2). The 
various immobilized fractions of the library are then combined and packed into a 
column. The proteome of interest is then place in contact with the library under 

0 conditions suitable to allow for specific interaction between the proteome and library 
components. The resin is then washed to remove non-specifically bound proteins, 
typically using both a high ionic and a low ionic strength buffer. The bound proteins 
are then labeled with a detectable maker, for example either iodinating with I 125 , or 
reacting with fluorescent marker or dye (e.g. iodofluorescein). The excess probe is 

5 washed away and the beads removed and individual beads placed into 96 well 
microtitre plates. The plates are then scanned for protein either by detecting 
radioactivity, fluorescence or color. 

Beads that are positive for protein are treated with SDSPAGE sample 
buffer and their protein content determined by gel electrophoresis as described 

) previously. If a protein is deemed to be of value, the bead that contained the 

identified protein is treated to liberate the ligand (library component) for chemical 
identification. 

In some instances a library component may be identified that binds to 
many protein targets. As outlined in Fig. 3 a ligand that interacts with many protein 

> targets can also be used to identify important drugs. In accordance with this 

embodiment a proteome is passed over a resin containing a single ligand (e.g. gamma- 
phosphate linked ATP Sepharose). Following washing to remove non-specifically 
bound proteins, either the bound proteins are labeled as described in Fig. 2 or the 
column is successively washed with components in a chemical library and fractions 

» collected as described in Fig. 1 . If the proteins are labeled then the beads are removed 
from the column and placed into microtitre plate wells (1-10 beads/well to give -20 
nmols total protein). Individual components in the library are then applied to each 
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well and the supernatants sampled for protein release. Preferably, each library 
component should be added at increasing doses in orders of magnitude ranging from 1 
nM to 1 mM. Proteins that are selectively liberated in the nm^M range are analyzed 
by SDSPAGE and mixed peptide sequencing is conducted as described above. 
5 In accordance with one embodiment the method of identifying 

bioactive compounds from a complex mixture of proteins comprises contacting a 
combinatorial library with the complex mixture of proteins under conditions that 
allow for specific binding of the proteins to library components. Preferably the library 
components are immobilized on a solid support via a covalent linkage, and numerous 
1 0 compounds of the library are present in multiple copies that are bound to the solid 
support in multiple orientations. The immobilized library is then washed with a 
buffered solution, preferably with a high ionic strength buffer and a low ionic strength 
buffer, to remove non-specifically bound proteins. In accordance with one 
embodiment the solid support is in particulate form, and the method further comprises 
1 5 the step of distributing equal portions of the support particles into a plurality of wells 
of a microtitre plate after the step of washing the immobilized compound library. 

The proteins bound to library components by specific interactions are 
then released, preferably by a competition reaction using one or more of the 
components of the library (in an unbound state). Wherein the library has been 
20 fractionated and equal portions of the support particles have been distributed into a 
plurality of wells of a microtitre plate the step of releasing the bound proteins 
comprises adding to each microtitre plate well one or more compounds of the library. 
The released proteins are characterized using standard techniques and the compounds 
of the library that specifically bind to the released proteins are identified. 
25 The release proteins will be identified primarily based on 

microsequence analysis and comparison to existing protein databases. This has been 
made feasible because of the near completion of the nucleotide sequencing of several 
genomes, including human, mammalian, C.elegans, bacteria, yeast, viral, rice, corn. 
The invention will have increasing relevance as more species specific genomes 
30 become complete. The proteins remaining bound to the immobilized library after the 
washing steps can also be labeled to assist the detection of the proteins. 
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Isolation and identification of target proteins with an undefined 
immobilizedchemical library. 

All of the initial steps are the same as described above. Following 
preparation of the resins and application of protein mixture, the resins are again 
aliquoted into titre plate wells. However, since the components in the library are 
poorly defined and of unknown number some fractionation of the library using 
standard methods is required. Useful steps for fractionation include organic and 
aqueous extraction, HPLC or ionic-exchange fractionation. The fractionated library is 
then applied to each well containing resin and incubated as described. In one 
embodiment these steps are carried out robotically. The column chromatography 
approach out-lined above is also applicable with an undefined library. 

Following incubation with each fraction of the library the resin is 
pelleted as described above and a sample of the supernatant taken for protein analysis. 
Fractions that contain liberated protein are selected for characterization by SDSPAGE. 
15 At this point it is likely that some fractions of the library will liberate many proteins, 
some only a few. In either case, mixed peptide sequencing or mass spectrometry can 
be used to identify all these components in a short space of time. With mixed peptide 
sequencing a standard Edman sequencer containing 4 reaction chambers can identify 
20-30 proteins per week. Mass spectrometry will be somewhat slower if a species- 
20 specific database is not available. The list of proteins that are identified for each well 
is then surveyed for the criteria stated earlier. 

Proteins that are deemed valuable are expressed as recombinant 
proteins and immobilized on a second resin (Sepharose or agarose beads). The 
fraction of interest or entire chemical library is then passed over the protein affinity 
25 column to selectively recover the chemical compounds with in the library that bind the 
protein of interest. Mass spectrometry or NMR techniques can then be used to 
identify or determine the structure of the bioactive compound. One then proceeds 
with the standard methods to characterize the bioactivity of the isolated chemical. All 
three strategies as outlined in Figs. 1-3 can be applied to an undefined library. 
30 111 accordance with an alternative embodiment bioactive compounds 

are isolated through the use of intact cells. This method is particularly useful for 
identifying bioactive agent that interact with cell surface molecules such as receptors. 
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The cells are grown on a cell culture substrate suitable for the type of cell grown. A 
complex mixture of labeled proteins or peptides is then added to the cells under 
conditions that allow for specific binding of the labeled proteins to the cell surface 
proteins. In one embodiment the cells are culture in multiple plates and the complex 
5 mixture of labeled proteins/peptides is divide equally among the multiple plate of 
cells. The cells are then washed under conditions that do not dislodge the cells from 
the substrate. In accordance with one embodiment the cells can be fixed to the cell 
culture substrate prior to incubation with the labeled proteins. 

After the plates have been washed to remove non-specifically bound 
1 0 proteins, the plates are screened for the presence of labeled proteins. The labeled 
proteins/peptides are released using the same procedures as described above and 
analyzed by gel electrophoresis and microsequencing. 

In accordance with one embodiment, the complex mixture of labeled 
proteins comprises randomly generated peptide sequences that have been labeled with 

1 5 a fluorescent entity. The binding of such labeled proteins to the cell surface molecules 
effectively concentrates the label at the bottom of the culture plate and thus a positive 
reaction can be detected even in the absence of washing the cells to remove unbound 
labeled protein. For example, an excitation light source can be provided wherein the 
beam of light is parallel to the cell surface and the detector is similarly position so that 

20 only signal generated from the cell surface will be detected. The bound protein can 
then be released using any of the techniques described previously herein, and the 
protein can be analyzed as describe above. 

Fig. 4 exemplifies one approach used in accordance with the present 
invention for identifying cell surface receptors and their peptide ligands en masse. 

25 The overall scheme outlined in Fig. 4 is a variation of that disclosed in Fig. 2. 
Bioactive peptides are of pharmaceutical value because they mimic naturally 
occurring proteins or larger peptides that bind to important cell surface receptors e.g. 
interferon's, cytokines, growth factors, endorphins. Bioactive peptides can be 
generated randomly in large libraries using combinatorial approaches. Peptides in 

30 these libraries are general range from 4 to 20 amino acids in length. These libraries 
can be generated synthetically or recombinantly as fusion proteins. In the case of 
fusion proteins, random peptide sequences are displayed at the N or C termini of a 
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known recombinant protein expressed in yeast or bacteria (Blum et aJ. 2000 PNAS91, 
2241; Geyer et al. 1999 PNAS 96: 8567). The fusion protein displaying the peptide 
are recovered by affinity chromatography through an affinity tag that is present at the 
opposite end of the molecule (C or N termini) from the peptide. 
5 There are 24 naturally occurring amino acids in nature which can be 

used to construct a random combinatorial library. A peptide library consisting of 
peptides of 20 amino acids in length can therefore have 20 24 possible combinations. 
This gives an extremely large number of possible variations of peptide and 
theoretically covers all possible peptide sequences that could occur in nature. 

1 0 Typically these types of peptide libraries can be used to search for cell surface 
receptors or protein partners that would selectively bind one or more peptide 
sequences contained within them. As shown in Fig. 4, a recombinant peptide library 
cultured in bacteria or yeast, or a synthetic combinatorial peptide library tagged with a 
fluor (e.g. fluorescein), is mixed (at InM - luM concentration) with a designated 

1 5 target cell line (e.g. cancer cell line, B or T cell) that is present in the wells of a multi 
chamber titer plate. The plate is placed in an instrument capable of detecting 
fluorescent labeled probes on cell surfaces at 100 -5000 molecules per cell. In our 
laboratory we use the PE-Biosystems FMAT robot (Swartzman et al. 1 999 Anal. 
Biochem. 271 : 143). In the case of synthetic peptides the cells are screened for 

20 specific binding of fluor tagged peptides on the cell surface. In the case of 

recombinant fusion proteins displaying the random peptides a fluor tagged antibody 
that recognizes the fusion protein is added. 

The peptides that produce positive results are sequenced. In the case of 
the synthetic peptides this can be done directly without further purification in an 
25 Edman sequencer or mass spectrometer. In the case of the bacterially or yeast 

expressed fusion protein two methods can be used to sequence the peptide. Positive 
clones can be cultured and the expression vector encoding the fusion protein can be 
sequenced across the region encoding the random peptide by DNA sequencing. Or 
alternatively, the fusion protein can be isolated from a culture of bacteria or yeast and 
the random peptide sequence determined by mass spectrometry or Edman sequencing. 
Once the peptide sequence is identified it is produced synthetically and tagged with a 
fluorophor. The affinity of the peptide for the cell surface receptor is determined. If 
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the affinity is determined to be sub uM, an affinity column is constructed from the 
peptide for purification of the receptor target. Typically this would involve linking the 
peptide via its C or N terminus to a C8 spacer attached to a Sepharose bead. Cell 
extract prepared from the designated cell target would then be passed over the resin to 
recover the receptor target for identification by protein sequencing. Bound proteins 
would be recovered by either eluting the resin with free peptide or washing with SDS. 

SDSPAGE and protein staining would be used to quantitate and 
evaluate purity. Mixed peptide sequencing or mass spectrometry would be used to 
identify the protein. If the protein is found to be an important cell surface receptor of 
commercial or medical value the peptide and protein target would be fully 
characterized. This screen is anticipated to identify many cell surface receptors and 
tlieir bioactive peptide ligands. Some receptors will be well characterized, many 
others are anticipated to be novel. Significantly, in addition to identifying new 
peptide ligands and their physiological targets, our assay, like the other methods, also 
1 5 gives a measure of selectivity and potential toxicity. This is because the identified 
bioactive ligands, in addition to their true physiological target, had an equal 
opportunity to interact with all other cell surface receptors that happen to be expressed 
on the designated cell target. 

20 Example 1 

Isolation of Adenine Nucleotide Binding Protein from a Proteome 

As a proof of principle and to evaluate the types of proteins that bind to 
gamma-phosphate linked ATP-Sepharose, tissue extracts prepared from rabbit skeletal 
muscle, liver, kidney, brain or bladder were passed over a gamma-phosphate linked 

25 ATP-Sepharose affinity resin. Following extensive washing to remove non- 

specifically associated proteins, the resin was washed sequential with NADH, AMP, 
ADP and ATP. Fig. 5 shows the results from characterization of proteins isolated 
from skeletal muscle. Similar results were obtained with other tissues, although the 
pattern, abundance and complexity between tissues varied considerably due to varied 

3 0 levels of expression of individual proteins (See Fig. 5 and Table 1 ). 

Gamma phosphate linked ATP -Sepharose was washed with extracts 
prepared from rabbit, skeletal muscle, kidney, liver, brain or bladder. Following 
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washing the resin was eluted successively with the indicated nucleotides as described 
in Fig. 5. 

Proteins 1-17 (see Fig. 5) were identified by mixed peptide sequencing 
(Table 1). Eluted proteins were characterized by SDSPAGE, transferred to PVM and 
treated with CnBr or Skatol prior to mixed peptide sequencing. Mixed peptide 
sequencing was carried out on average for 6-12 Edman cycles. The mixed sequences 
were sorted and matched against the entire published protein or DNA data bases with 
the FASTF or TFASF algorithms respectively (Darner et al. 1998., Alms et al. 1999). 
Expectation scores for the identified proteins ranged from 2.6 e 7 for PKA to 1.2 e M 
for GAPDH. Expectation scores after each search for the next highest scoring non- 
related protein were generally < 2.3 e * The experiment shown was repeated on 
several occasions, and on several different tissue including liver (120g) , kidney (60g), 
brain (60g) and bladder (20g) with similar results (Table 1). 
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Table 1 


NADH 


pmol 


AMP 


nmnl 


GAPDH (M) 1* 


25,000.0 


Purine synthetase (M,L) 2* 


5 0 


malate dehydrogenase (M) 8* 


0.5 


Phosphorylase (M) 3* 


10.0 


glutarnate dehydrogenase (L) 9* 


1.5 


A\4T^ ^PtivJitpH r»rr\t<=»in 

kinase (L) # 


z.U 


aldehyde dehydrogenase (M) 7* 


2 0 




4.0 


lactate dehydrogenase (M,L) # 


50.0 


ESTAA571903 (L) 


5.0 


dehydrogenase CM T.V 




phosphatidylinositol-4- 
phosphate 5 -kinase (L) 


2.0 


isocitrate dehvdropena<;p (\ \ 


1 a 
i .u 


protein kinase DUN] 
(related) (L) 


1.0 


3 -hydroxy acyl-CoA 
dehydrogenase (L) 


1.0 






sorbitol dehydrogenase (L) 


2.0 






alcohol dehydrogenase (M,L) 


20.0 






glucose-6-phosphate 
dehydrogenase (M,L) 


17.0 
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Table 1 (cont.) 


ADP 


pmol 


ATP 
Air 


pmol 


Heat shock protein 
90(M/L)4*# 


15 000 0 


kVLrvr fv ^1V1 j J ft 


5.0 


Purine synthetase 
(M/L) 2* 


! 14.0 


MEK1 (MM 


6.0 






Pyridoxal kinase (M ] ^ 1?* 


inn 






Arginnosuccinate synthetase fN/H 1 1 * 


t n 






Glutamate ammonium lipase CKA T \ 


z.u 






Adenosine kinase (M,L) 15* 


5.0 






CSK fivn 1 fs*# 


4.0 






HSPA5 (M"i 17* 


4.0 






P90 S6 kinase fN/f l ^ 14* 


0.5 








0.2 






1 JrllUl/Acll lVlIld.bC 11-*) 


5.0 






R9J\ Pllir-n^p inHnrp/^ Unooo 
1 ^ ° giuwuoc iiiuuccu ivlJlaSC 

(M,L,SM,B,K) 10* 

\ 7 7 7 7 J 


10.0 






Heat shock protein 70 (L,M,B,SM,K) 


55 0 






PKA (L,M,B,SM) 


6 0 






Glutamine synthetase (L) 


9.0 






ZIP Kinase (SM)# 


0.2 






Phosphofructokinase (L) 


6.0 






Heat shock protein 60 (M,L,B > SIVl,K) 


10.0 






RNA helicase (L) 


1.0 






Protein kinase PC- 1 (L) 


0.6 






Protein kinase C epislon (L)# 


0.2 






Beta tubulin (B)# 


50 
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P6 electron transport flavoprotein a 
subunit (L) 


1.0 






serine/theonine-protein kinase ip] 1 
(related) (L) 


0.2 






protein kinase C beta-II (L) 


0.2 






protein kinase kern (L) 


0.3 






cyclic G kinase (SM)# 


5.0 






lupus nephritis protein LN1 (SM) 


1.0 






casein kinase 1 (M) 


5.0 






casein kinase 1 1 (M) 


6.0 






GSKin (M) 


4.0 






lim domain kinase 1 (SM) 


0.2 






protein kinase pkxl (SM) 


0.3 






pp60c-src (M) 


1.0 






MLCK (M,SM)# 


6.0 






Phosphorylase kinase (M) 


20.0 






Arginine deimidase (M) 18* 


5.0 






CAM kinase II (B,M)# 


1.0 






fructose- 1 -6-bisphosphatase (M,L) 


2.0 



The pmol amount of protein recovered was determined from PTH amino acid 
recovery during mixed sequencing multiplied by the volume applied to the gel and 
fraction volume (mis), indicates proteins identified in Fig. 5; # indicates proteins 
tested for binding efficiency by Western analysis of the column flow through. 



To unambiguously identify the eluted proteins in each case, peak 
column fractions were transferred to PVM following SDSPAGE and subjected to 
25 mixed peptide sequencing (Table 1). Table 1 shows the identification of over 70 
proteins that were eluted from ATP Sepharose loaded with skeletal muscle, liver, 
brain, kidney or bladder. Without exception, all of the proteins identified in the 
protein data bases belonged either to the protein kinase, dehydrogenase or ATP-grasp 
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classes of purine binding proteins. Analysis of PTH amino acid recovery during 
mixed peptide sequencing reveals that the affinity resin recovered proteins of both 
high and low cell copy number (Table 1). Western blotting of the column flow 
through with antibodies to several of the identified proteins demonstrated that the 
5 resin absorbed the tested proteins with >85 % efficiency from the initial extract (Table 
1). This finding indicated that the differences in recovery of individual proteins in the 
nucleotide washes was a reflection of cell copy number rather than because of affinity 
differences for the immobilized ATP. 

Several of the identified proteins have been crystallized with NADH, 
1 0 AMP, ADP or ATP bound and these published structures explain selective recovery 
of each classification of protein from the affinity resin. Inspection of the three 
dimensional structure of 10 of the dehydrogenases identified in the NAD wash shows 
in each case the adenine portion of these nucleotides is buried within a cleft 
containing the conserved Gly-X-Gly-X-X-Gly loop. The diphosphate portion of the 
1 5 bound nucleotides spans an open region on the surface connecting to the nicotinamide 
moiety accommodated within a closely situated second binding site. 

The finding that 0.5 mM NADH/NAD exclusively eluted 
dehydrogenases over other purine binding proteins is consistent with the well 
established preferences of these types of enzymes for nicotinamide containing purines. 
20 Although it should be noted that in separate experiments increasing the concentration 
of NADH/NAD to >10 mM did begin to elute many of the proteins found in the 
subsequent AMP wash. Characterization by microsequencing of the AMP eluate 
identified two proteins that are allosterically regulated by the nucleotide, glycogen 
phosphorylase and the AMP activated protein kinase. In the case of phosphorylase, 
25 elution with AMP is consistent with the crystal structure of enzyme in its T' state, 
which is favored by the presence of glucose, ADP and ATP competitors, low 
concentrations of substrate (Sprang et al. 1991). Purification of the AMP activated 
protein kinase over gamma phosphate linked ATP-Sepharose has been reported 
previously (Davis et al 1996). The enzyme is known to contain both an ATP and 
30 AMP binding pocket and is activated by AMP in vitro. Recovery of the kinase with 
AMP therefore is most likely because of interaction with the immobilized ATP with 
its AMP binding pocket. Elution of multifunctional protein ADE2 with AMP (and 
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also ADP) is consistent with involvement of this protein in catalyzing the conversion 
of AIR to CAIR (steps 6 and 7) in purine biosynthesis. Although the three 
dimensional structure of mammalian ADE2 has not been solved, the related E.coli 
enzyme, N5-carboxyaminoimidazole ribonucleotide synthetase (PurK) with ADP 
5 bound was recently reported (Thoden et al. 1 999). PurK belongs to the ATP grasp 
superfamily of purine binding proteins, and in prokaryotes, plants and yeast, catalyzes 
the conversion of AIR to CAIR in a two step process (steps 6 and 7 of 10) involving a 
second distinct gene product PurE. Recovery of mammalian ADE2 from the affinity 
resin in this present study suggests it also binds purine nucleotides in a similar 
1 0 orientation to that found in PurK. Phosphatidylinositol-4-phosphate 5-kinase, protein 
kinase DUN1 (related) and the two EST AA254816, ESTAA571903 all contain 
nucleotide binding motifs in their primary sequence and elution of these proteins with 
AMP suggests that they also bind the purine orientating the phosphate such that it is 
solvent accessible. 

1 5 Elution of the resin with ADP following AMP eluted two proteins, 

HSP90 and ADE2 in all tissues tested. The recovery of additional ADE2 with ADP 
suggests that this enzyme may either have two separate adenine binding pockets or 
exist in two conformational states that discriminate the presence of _ and _ phosphates 
on the two types of purine. Recovery of HSP90 with ADP is consistent with recent 

20 reports by Toft and co-workers identifying the N terminal domain of HSP 90 as a Mg 
2+ ATP/ADP binding domain and the crystal structure of this domain with ADP or 
ATP bound (Prodromou et al. 1997, Stebbins et al. 1997 35-38). Interestingly, the 
purine binding pocket on HSP90 was not readily predicted to exist based on primary 
.sequence alignments alone. Recovery of HSP90 suggests that other non-conventional 

25 purine binding proteins presenting adenine containing nucleotides in the "protein 
kinase" orientation are likely to be recovered from the affinity resin. Examples of 
other proteins that been crystallized with purine bound and classified as having non- 
conventional binding domains are the adenine binding domain of DNA gyrase B 
(Wigleyet al. 1991), the AMP binding sites on glycogen phosphoryjase and adenylate 

30 kinase, the ADP binding sites on fructose- 1 -6-bisphosphatase and 

phosphofructokinase, the cyclic AMP binding sites on catabolite activator protein and 
ATP binding site in DD-ligase. Notably four of these proteins were subsequently 
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recovered in the ATP wash (Table 1). 

Final elution of the affinity resin with ATP eluted a diverse range of 
proteins in all tissues tested, from heat shock proteins and metabolic enzymes with 
non-conventional nucleotide binding folds to a variety of protein kinase family 
5 members (Table 1). The majority of the proteins recovered are known to utilize Mg 
2+ ATP and show a high degree of specificity towards the nucleotide. Consistent 
with this observation, we have found that inclusion of mM NADH, AMP and ADP in 
the extraction buffer completely abolishes binding of all of the proteins shown in 
Table 1 that would normally be recovered from the resin in their absence (data not 

10 shown). In contrast, proteins identified in the AIT elution are retained on the resin 
under these conditions. Amongst the most frequent of all the adenine binding proteins 
sequenced in Table 1 are protein kinases. It has been estimated that up to 2% of the 
entire human genome may encode a protein kinase with a highly conserved ATP 
binding cassette. When the amino acid sequences of over 400 protein kinases are 

1 5 aligned with that of cyclic AMP dependent protein kinase, 1 5 amino acids residues 
within 1 1 conserved subdomains are nearly invariant In addition, there are 19 
hydrophobic amino acids of similar structure that are also conserved within the 
protein kinase family. In the activated state, the conserved and invariant amino acids 
of the ATP binding cassette make intimate contact with MgATP and orientate the 

20 molecule such that its gamma phosphate is exposed at the lips of the catalytic cleft. 
The recovery of several distinct tyrosine and serine/threonine protein kinases as 
reported herein, and reports by others utilizing gamma phosphate linked ATP- 
Sepharose in purification schemes for specific kinases, demonstrates that this affinity 
resin is likely to bind all protein kinase family members. Furthermore, this finding, 

25 combined with the frequency of occurrence of protein kinases, dehydrogenases and 
some of the other proteins identified in Table 1 in the current protein and DNA data 
bases suggests that the ATP resin may catch 3-5% of all proteins present in most 
eukaryotic genomes. 



! 
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Examplc 2 

Screening for Selective Inhibitors for Purine Binding Proteins 

To test the concept of proteorae mining of a combinitorial and natural 

products small molecule libraries for selective inhibitors of purine binding proteins, 

5 geldanamycin (GA) and 74 structural analogs were passed over g amma-phosphate 

Jinked AT P Sepharose that had previously been loaded with whole skeletal muscle 
v rr~— — m 

extract. To ensure that all proteins that bind adenosine in the "protein kinase 
orientation" the resin contained an ATP concentration between 10-15 tiMol/ml. 
Initial ligand screens were performed at 10 jaM which would enable only 

1 0 pharmacologically relevant competitive inhibitors to be identified in the small library. 
This is because any protein that was selectively eluted from the ATP resin, by a 
particular GA analog, in order to have pharmacological relevance in subsequent cell 
based assays would have to be able to compete with a physiological ATP 
concentration of -10 mM. As discussed earlier, the high ligand concentration also 

1 5 ensured that proteins of both high and low affinity, and copy number would be equally 
and maximally recovered by the resin. Fig. 6 shows a silver stain of peak column 
fractions after eluting the affinity resin with increasing concentrations of GA. A 
similar gel was transferred to PVM and the most abundant proteins identified by 
mixed peptide sequencing. 

20 Washing the affinity column with 10 nM GA was found to almost 

exclusively elute a single protein at 45kDa. The fraction also contained a minor 
amount of a 90 kDa protein. Mixed peptide sequencing identified the 45kDa protein 
as ADE2 and the 90kDa protein as HSP90. In particular, mixed peptide sequencing 
identified peptide sequences Met Phe Phe Lys Asp Asp Ala Asn Asn Asp Pro Gin Trp 

25 (SEQ ID NO: 1 ) and Met Lys lie Glu Phe Gly Val Asp Val Thr Thr Lys Glu (SEQ ID 
NO: 2) which were 100% identical to peptide sequences of purine multifunctional 
enzyme (ADE2). Mixed peptide sequencing also identified peptide sequences Met 
Thr Lys Ala Asp Leu He Asn Asn (SEQ ID NO: 3) and Met He Gly Gin Phe Gly Val 
Gly Phe Tyr (SEQ ED NO: 4) which were 100% identical to peptide sequences of heat 

30 shock protein 90 beta (HSP90). 

These findings identify ADE2 as a new target for GA in vitro. PTH 
analysis of the sequenced proteins indicated that there were 2 pmols of ADE2 in the 
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gel compared with <150 fmol of HSP90. Increasing the concentration of GA to 100 
nM eluted 2 proteins of 70kDa and 65 kDa respectively that were identified by mixed 
peptide sequencing as proteolytic fragments of HSP90. No other proteins were eluted 
from the resin until the concentration of GA reached 10 nM Approximately lpmol of 
5 HSP90 was recovered in the gel at this step. Increasing the concentration of GA to 
100 eluted a large amount of HSP90 and several N terminal proteolytic fragments 
of the protein. Approximately 1 1 pmols of the proteins was sequenced from the gel. 

At ImM GA 3 pmols of HSP90 was sequenced from the gel. 
Significantly no other proteins were eluted at this concentration. Subsequent, elution 
10 of the column with 10 mM ATP liberated a complex mixture of proteins of varying 
abundance and molecular weights. Consistent with previous results, mixed peptide 
sequencing of a selection of these proteins identified them as adenine nucleotide 
binding proteins. 

The finding that the GA concentration could be raised by 3 orders 

1 5 magnitude over the concentration required to elute ADE2 before significant HSP90 
was recovered demonstrates that in solution GA prefers the former enzyme over the 
later. Although, increasing the concentration to 1 mM did not elute any other proteins 
and is a testament to the selective of GA towards HSP90 and ADE2, these findings 
have implications for the actions of GA in vivo. In particular, the elution of ADE2 by 

20 GA illustrates a potential unforeseen toxicity of GA. The enzyme ADE2 catalyzes 
two essential steps that are required for the synthesis of purine nucleotides. Inhibition 
of ADE2 activity would therefore be toxic to all cell types. 

To discriminate functional regions on GA that may discriminate sites 
of interaction with HSP90 from ADE2, 74 structural analogs (Table 2) of the 

25 molecule were passed over gamma-phosphate linked ATP Sepharose that had been 
charged with skeletal muscle extract. 

Each analog of GA was washed over the gamma-phosphate linked 
ATP Sepharose that had been charged with skeletal muscle extract at 1 0 ^iM and the 
elutes analyzed by fluorography and SDSPAGE following tagging of the eluted 

30 proteins with iodofluorescein. The eluted proteins were visualized by laser induced 
fluorescence using a molecular dynamics flat bed imager. Using this method proteins 
that contained at least one reduced cysteine residue could be detected in the eluate at 
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<0.1 finol. Following fluorpohor labeling the eluted proteins were characterized by 
SDSPAGE and fluorography. Mixed peptide sequencing identified the eluted proteins 
as either HSP90 or ADE2. Several compounds within the small GA analog library 
have selectivity for HSP 90, while others are more selective for ADE2. 
5 Assay of purified rabbit and human ADE2 confirmed that all 

compounds that selectively eluted ADE2 from the affinity also inhibited the enzyme 
in vitro (Table 3). All assays were performed against purified human ADE2. Results 
shown are from three separate experiments. Compounds that eluted HSP90 selectively 
had no effect of ADE2 activity in vitro at uM concentrations. Significantly, 
1 0 compounds that selectively eluted HSP90 showed low biological activity in cell based 
growth inhibition assays. In contrast, compounds that showed selectivity for ADE2 
were potent inhibitors of cell growth. These findings demonstrate that in vivo, the 
biological effects of geldanamycin are because of its ability to inhibit ADE2 activity 
rather than through any actions on HSP90. 

15 
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Table 3. Determination of apparent Ki for ADE2 against GA structural analogs 





Inhibitor 

210760 


Ki-'( n M) 
6.803 


Inhibitor 

683663 


Ki"'( n M) 
37.037 


5 


189794 


3.546 


330506 


45.455 




182857 


5.525 


255109 


38.462 




189795 


0.688 


320877 


30.303 




255111 


5.051 


665479 


2.037 




189793 


3.497 


265482 


10.417 


10 


604169 


38.462 


320947 


21.277 




697886 


15.152 


182858 


58.824 




683662 


16.667 


156219 


1000.000 




672165 


1.250 


683660 


90.909 




330500 


500.000 


320946 


33.333 


15 


661581 


6.250 


330509 


21.277 




255107 


62.500 


156217 


22.222 




683201 


11.905 


169627 


7.634 




655480 


7.143 


156218 


58.824 




255104 


23.810 


359658 


1.815 


20 


682299 


13.514 


658515 


0.484 




655746 


0.688 


236651 


34.483 




682300 


18.868 


651937 


0.185 




JJ0510 


12.658 


330499 


0.615 




OftJOOO 


90.909 


683664 


1.692 




OOljOU 


1 A A f\1 

14.493 


707545 


62:500 




£ At 1 00, 
OOZ 1 yy 


55.624 


210753 


0.362 




690214 


17.544 


662.199 


50.000 




607306 


0.792 


320944 


20.000 




607307 


1.629 


236652 


10.526 


30 


255110 


22.222 


48810 


14.706 




321593 


20.833 


156216 


26.316 




674124 


13.699 


19990 


20.408 




156215 


45.455 


210761 


27.778 
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Claims: 

1 . A method of identifying bioactive compounds, said method comprising the 
steps of 

5 contacting a combinatorial library with the protein members of a proteome, to 

bind said protein members to compounds of said library, wherein the compounds of 
said library are immobilized on a solid support; 

washing the library with a buffered solution; 
releasing the bound proteins; 
1 0 characterizing the released proteins; and 

identifying the compounds of the library that bind to the released proteins. 

2. The method of claim 1 wherein the step of releasing the bound proteins 
comprises contacting the library with one or more compounds of said library. 

15 

3. The method of claim 1 wherein the step of washing the library 
comprises the steps of washing with a high ionic strength buffer and a low ionic 
strength buffer. 

20 4 The method of claim 1 wherein the compounds of said library are 

covalently bound to said solid support. 

5. The method of claim 4 wherein each of the library compounds are 
present in multiple copies that are bound to the solid support in multiple orientations. 



25 



30 



6. The method of claim 5 wherein the solid support is in particulate form, 
and the method further comprises the step of distributing equal portions of the support 
particles into a plurality of wells of a microtitre plate after the step of washing the 
immobilized compound library. 

7. The method of claim 6 wherein the step of releasing the bound proteins 
comprises adding to each microtitre plate well one or more compounds of the library. 
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8. The method of claim 1 further comprising the step of labeling the 
bound proteins after the washing step. 



5 



9. The method of claim 8 wherein the immobilized compounds are bound 
to polymer beads and the method further comprising the step of distributing single 
beads into separate wells of a microtitre plate, after the step of labeling the bound 
proteins. 



10 



30 



1 0. The method of claim 9 wherein the step of releasing the bound proteins 
comprises contacting the individual beads with a chaotropic agent. 



11. A method of identifying bioactive compounds present in a proteome, 
said method comprising the steps of 

contacting an immobilized ligand with the protein members of a proteome; 
1 5 washing the immobilized ligand with a buffered solution; 

contacting the bound proteome proteins with a target compound to release 
bound proteome proteins; 

collecting the released bound proteins; and 
determining the identity of the released proteins. 

20 

1 2. The method of claim 1 1 wherein the target compound is the same as 
the immobilized ligand. 

1 3 . The method of claim 1 1 wherein the target compound is a functional 
25 analog of the immobilized ligand. 

14. The method of claim 1 1 further comprising the step of labeling the 
bound proteins before the step of contacting the immobilized compound library with 
the individual component of the compound library. 



15. A method of identifying bioactive compounds from a poorly defined 
immobilized combinatory library, said method comprising the steps of 
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contacting a the c combinatory library with the protein members of a 
proteome, to bind said protein members to compounds of said library, wherein the 
compounds of said library are immobilized on a solid support; 

washing the library with a buffered solution; 

releasing the bound proteins; 

immobilizing a released protein on a solid support; 

contacting the immobilized released protein with compounds of said 
combinatorial library to bind components of the combinatorial library to the 
immobilized released protein; 

washing the immobilized released protein with a buffered solution; 

releasing the bound library components; and 

identifying the released library components that bind to the immobilized 
released protein. 
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STEP 1 . 

CHEMICAL 

LIBRARY LINKED 

TO SEPHAROSE/ 

AGAROSE 

VIA:- 

AMINE 

CARBOXYLIC ACID 

THIOL 

HYDROXYL 

ALDEHYDE 

PHOSPHATE 
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STEP 2. 

ADD RESINS WITH 
IMMOLIZED LIBRARY 
TO GLASS COLUMNS. 




STEP 3. 

APPLY PROTEIN 
MIXTURE. 




96 WELL TITRE 
PLATE 



STEP 5. 

DIVIDE RESIN 
OR KEEP IN 
COLUMN ADD 
OR ELUTE WITH 
INDIVIDUAL 
COMPONENTS OF 
LIBRARY. 
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STEP 6. 

ASSAY FOR PROTEIN. 




STEP 4. 

WASH WITH HIGH 
AND LOW IONIC 
STRENGTH 
BUFFERS. 



COLLECT 
FRACTIONS. 



O Q Q 
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STEP 7. 

PERFORM SDSPAGE ON 
PROTEIN POSITIVE WELLS. 



STEP 8. IDENTIFY PROTEIN BY MIXED PEPTIDE 
SEQUENCING OR MASS SPECTROMETRY. 



STEP 9. PROCEED TO DRUG EVALUATION IF LIBERATED 
PROTEIN(S) AT STEP 5 IS VALUABLE. 



FIG. 1 
SUBSTITUTE SHEET (RULE 26) 
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1. SEPHAROSE, POLYSTYRENE, ETC. 
BEAD FOR LIGAND ATTACHMENT 
0.1-0.5^01 (10-100 PER WELL). 
PLACE IN TITRE PLATE WELLS. 





2. ATTACH CHEMICAL LIBRARY- 
1-10 COMPOUND PER BEAD. 





3. TAKE 1 BEAD 
PER WELL AND 
PACK (100-10,000) 
INTO COLUMN. 



APPLY PROTEOME. 



7. ANALYZE TITRE 

2£3SS* f 0R 6. UNPACK COLUMN. 

„52Iw£L. PLACE 1 BEAD PER 

RECOVERY. WELL 





■5. WASH AND TAG 
PROTEINS (RADIOACTIVE 
E.G. 1125 OR FLUOR 
OR DYE). 



8. TAKE POSITIVE BEADS 
AND TREAT WITH SDSPAGE 
BUFFER-CHARACTERIZE BY 
SDSPAGE. 



10. RELEASE LIGAND 
FROM POSITIVE BEAD 
TO IDENTIFY LEAD 
COMPOUND(S). 

O 



9. SEQUENCE PROTEINS. 
DECIDE WHETHER THEY 
ARE IMPORTANT. 



a 



FIG. 2 
SUBSTITUTE SHEET (RULE 26) 
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STEP 1. ATTACH 0)S^ O 
LIGAND TO W( ) 

CHROMATOGRAPHY W 
BEAD 10-20 nmol/BEAD 
(E.G. ATP). PACK INTO 
CHROMATOGRAPHY 
COLUMN. 



STEP 3a. WASH WITH 
INDIVIDUAL LIBRARY 
COMPONENTS. 




STEP 2. APPLY PROTEOME WASH 
COLUMN TO REMOVE NON-SPECIFICALLY 
BOUND PROTEINS. 



STEP 3b. LABEL 
BOUND PROTEINS 
WITH 1125, FLUOR 
OR DYE. 



STEP 4a. COLLECT 
FRACTIONS. 



STEP 5. ASSAY 
FOR LIBERATED 
PROTEIN. 




STEP 4b. REMOVE 
BEADS FROM COLUMN 
PLACE 1-10 PER WELL. 
APPLY INDIVIDUAL 
LIBRARY COMPONENTS 
TO EACH WELL 



STEP 6. TAKE POSITIVE BEADS 
AND TREAT WITH SDSPAGE 
BUFFER-CHARACTERIZE BY 
SPSPAGE. 

STEP 7. SEQUENCE PROTEINS 
DECIDE WHETHER THEY ARE 
IMPORTANT. 



FIG. 3 
SUBSTITUTE SHEET (RULE 26) 
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1 . TARGET CELL TYPE CULTURED IN 
96 OR OTHER MULTIWELL DISH. 

2. APPLY COMBINITORIAL PEPTIDES 
(EITHER INDIVIDUAL OR IN MULTIPLES 
OF 10-1000). PEPTIDES ATTACHED TO 
FLUOR OR DISPLAYED ON FUSION 
PROTEIN. 

3. IF NOT ALREADY INCORPORATED 
INTO PEPTIDE ADD FLUOR TAGED 
ANTIBODY AND SCREEN FOR SPECIFIC 
CELL SURFACE BINDING. RESCREEN 
PEPTIDES IF MORE THAN ONE PEPTIDE 
PER WELL USED, NARROW TO ONE 
PEPTIDE OR CLONE. 

4. RECOVER PEPTIDE FROM ORIGINAL STOCK WELLS OR 
CULTURE ORGANISMS PRODUCING FUSION PROTEIN. 




5. ISOLATE AND SEQUENCE 
SYNTHETIC PEPTIDE BY MASS 
SPECTROMETRY OR EDMAN 
SEQUENCING OR SEQUENCE 
DNA OF FUSION PROTEIN. 



6. DETERMINE BINDING CONSTANT 
OF POSITIVE PEPTIDES. ENSURE THAT 
THEY ARE PHARMACOLOGICALLY 
RELEVANT <10|J.M. 



CELL EXTRACT 
THEN WASH. 




7. SYNTHESIZE RELEVANT PEPTIDES. CONSTRUCT 
AFFINITY RESIN. APPLY CELL EXTRACT PREPARED 
FROM TARGET CELL. WASH LOW HIGH IONIC 
STRENGTH BUFFERS TO REMOVE NON-SPECIFICALLY 
BOUND PROTEINS. 

PURE PEPTIDE OR 
SDSPAGE SAMPLE 
BUFFER. 

8. ELUTE BOUND PROTEINS 
WITH EXCESS PEPTIDE, 
CHARACTERIZE BY 
SDSPAGE, OR BOIL BEADS 
IN SDSPAGE SAMPLE 
BUFFER. 



SDSPAGE 





FIG. 4 



9. IDENTIFY BY 
PROTEIN 
SEQUENCING 
BOUND OR 
ELUTED PROTEINS 
FROM GEL. 



SUBSTITUTE SHEET (RULE 26) 



10. DECESION VALUABLE 
PROTEIN TARGET. 



YES 




PROCEED TO FULL CELLULAR 
CHARACTERIZATION OF CELL 
SURFACE RECEPTOR. 
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SEQUENCE LISTING 



<110> Haystead, Timothy A 

University of Virginia Patent Foundation 

<120> Proteome Mining 

<130> 00164-02 

<140> 
<141> 

<160> 4 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 13 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 1 

Met Phe Phe Lys Asp Asp Ala Asn Asn Asp Pro Gin Trp 
1 5 10 



<210> 2 
<211> 13 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 2 

Met Lys He Glu Phe Gly Val Asp Val Thr Thr Lys Glu 
1 5 10 



<210> 3 
<211> 9 
"<212> PRT 

<213> Oryctolagus cuniculus 
<400> 3 

Met Thr Lys Ala Asp Leu He Asn Asn 
1 5 
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<210> 4 

<211> 10 ~ 2 ' 
<212> PRT 

<213> Orycfolagus cuniculus 
<400> 4 

Met He Gly Gin Phe Gly Val Gly Phe Tyr 
1 5 10 
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.m 

HOBBIE COPBEHTbl RJlfi AO<E>HHHOH XPOMATOrPAOHH - | 
nPOH3BOflHWE OOC^OAMHflOB yPHflHHOBMX HYKJIEOTHAOB 

MuGaee B. JT., Kycoe K). JO., Ra^unuyn JJ. A., 
Konemuoe H. K. 

HHcmumym opaanimecKoti. xumuu um. H. fl. Scauhckozo 
AnadeMUu nayn CCCP, Mocnea 

B3aHM0floficTBHCM N^yprAHH-5'-()i>ocr|)opHjj)- h NHypHAHH-5'-nHpo<J>oc(}>opHJi)-l ,6- 
AHaMHHoreKcaHOB c aKTHBKpoBUHHOH OpoMi^HaHOM ce<j>apo30H no/ryneHH 6FocneAH<j)iPiec- 11 
KHe cop6eHTM ajih <J)epMeHTOP, B3aHMOAeHCTByK>iimx c ypuflUHCOAepHcamnMH HyiuieoTH- J| 

AaMH. Cop6eHTM COflepWaT HMM06lIJIH3OBaHHIJe OCTaTKH UMP H UDP, CBH3aHHHe c 5| 
MdTptll^eH (j)0C({jOaMHAHOn CBH3BH). * | 

A(j)$HHHaH xpoMaTorpa$HH c Hcnojib30BaHH6M aACop6eHTOB, coAep>Kam;HX % 
HMMo6ajiH30BaHHHe HyKjieoTHAW, nojiyqHJia b nocjieamie vojyu innpoKoe pac- | 
npocTpaHenne rjih ohhctkh pa3Hoo6pa3Hwx ^epMeHTOB [1, 2J. OcoSohho J 
Sojibraoe BHHMaHne yAejinjiocb nojiy^eHHio aAcop6eHTOB — npoH3BOAHMx A 
aAeHHHOBux HyKJieoTHAOB, b to BpeMH Kan nojiy^eHHe HocnTejien c hmmoSh- 

JIH30BaHHHMH ypHAHHOBblMH HyKJieOTHAaMH, KOTOpbie, HO-BHAHMOMy, MOryT 

HaaTH mapoKoe npHMeHOHHe npn o^HCTKe 6o;ibinoH rpynnH $epMeHTOB, 
B3aHMOAeHCTByK>miix c ypHAHHC0Aep>Kam;HMH h y k ji eoTHAaMH , H3yneHo b 3Ha- 
qnTejibHO MeHbinefi CTeneHH. B onncaHHbix b jiHTepaType npHMepax hmmoSh- 
jiH3aii;HH npoH3BOAHHx UMP Ha arapo3Hbix MaTpHijax ocymecTBjiHJiacb 
nepe3 ocTaTOK ph6o3h nocjie nepnoAaTHoro okhcjighhh ijnc-rjiHKOJibHOH 
rpynnnpoBKH [3—5] hjih nepe3 ociaTOK <J>oc$opHOH khcjioth c ncmombio 
n-aMHHo$eHHJioBwx 3$npoB [6, 7], a HMMo6HJiH3aijHH UDP — ^epe3 ocTaTOK 
Ph6o3h [4] hjih p-$oc$axa c noMombio 6-aMHHoreKCHJioBoro a^npa [8]. 
HeAaBHo npeAJionceH hobhh motoa HMMo6njiH3au;HH HyKJieoTHAOB Ha hoch- 

TeJIHX, OCHOBaHHblH Ha B3aHMOAeHCTBHH CMeniaHHHX aHrHApHAOB HyKJieOTH 

AOB h Me3HTHJieHKap6oHOBofi khcjioth c NH 2 -rpynnoH aMHHoreKCHJice$apo- 
3H [9, 10]. C noMombio 3Toro MeTOAa nojiyqeHbi aACop6eHTbi, CBH3aHHbie c pa3- 

JIHHHblMH MOHO- H OJIHrOHyKJieOTHAaMH, a TaKJKG HyKJieOTHAnOJIH$OC(|)aTaMH, 

oAHauo aAcopSeHTH, coAep^Kamne npoH3BOAHbie ypHAHHo bh x HyKJieoTHAOB, 
npnroTOBjieHbi He 6hjih. 

HeAOCxaTOK 3Toro cnoco6a cHHTe3a a<|)<f>HHHbix aACopSeHTOB coctoht b co- 
xpaHeHHH b nojiyneHHOH Maipnije He3aMemeHHbix aMHHorpynn, *ito MOH<eT 

npHBOAHTb K nOHBJI6HHIO y COp6eHTa H0H006MeHHHX CBOHCTB H OCJIOJKHe- 

hhhm npn 6HOcneu|H$HHecKOH xpoMaTorpa$HH [2]. MHoroo6em;aiomHM KameT- 
ch MeTOA, OCHOBaHHblH Ha B3aHMOAeHCTBHH jiHraHAOB, coAepjKau^Hx cboSoa- 
Hyio aMHHorpynny, c aKTHBHpOBaHHon 6poMijHaHOM ce$apo30H. 

B HacTonmen pafioTe mm cooSni.aeM 0 nojiyneHHH Ha ocHOBe ce$apo3bi 

aAC0p6eHT0B, COAep>Ka^HX HMM06HJIH30BaHHbie OCTaTKH UMP H UDP, 

KOTopbie npncoeAHHeHH k MaTpnq;e nepe3 ocTaTOK $oc$aTa c noMombio $oc- 
120 
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)eHTOB, coAepncamux 
s roAbi innpoKoe pac- 
b [1, 21. Oco6eHHO 

TOB — IipOH3BOflHbIX 

[ocHTejiea c hmmo6h- 
ao-BHAHMOMy, MoryT 
rpynnu $epMeHTOB, 

3MH, H3yneHO B 3Ha~ 

e npHMepax hmmo6h- 
ax ocymecTBjiHJiacb 

HH IJHC-rjIHKOJIbHOH 

HCJIOTH c noMom,bio 
DP — qepe3 ocxaTOK 
DBoro 9$Hpa [8]. 
yKjieoTHAOB Ha hoch- 
thaphaob HyKJieOTH 
aMHHoreKCHJice$apo- 
w, cBH3aHHHe c pa3- 

)THAHOJIH({)OC(|)aTaMH, 
IOBHX HyKJieOTHAOB, 

5eHTOB COCTOHT B CO- 

lorpynn, hto mojkgt 

CBOHCTB U OCJIO>KHe- 

:>o6emaiomHM KaH<eT- 
•OAepraamnx cbo6oa~ 

()ap030H. 

ra ocHOBe ce<J)apo3H 
kh UMP h UDP, 
aia c noMombio $oc- 



AoaMHAHofi CBH3H [41. Hx nojiyqerae H3 cooTBeTCTByiomux nyKJieoTHAOB (I 
E^aeT b ce6a npeBpameHne nocaeAHHx b CMemaHHBie aHrnAPHflH (II) 
f aareM b *oc<t>oaMHAbi - nponaBOAHbie reKcaMeiHJieHAHaMHHa (III), koto- 
pbte AaJiee H MMo6HJiH3yiOTCH Ha CNBr-ce$apo3e: 
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KoHneHcanHfl UDP (16) c xjiopaHrnflpHflOM M e3HxnjieHKap6oHOB0H khcjio- 
th (MesCOCl) npoBOflHJiacb b ycnoBHHX, ncnojibaoBaHHHX panee ajih cmrreaa 
CMBinaHHHx aHrHApH«0B HI, 12]. AaajiHa peaKi^HOHHOH CMecn c no moijbio 
xpoMatorpa^HH h 9JieKTpo<pope3a na 6y M are noKaaaa npHcyrcxBne AByx rjiaB- 
hhx HyK^eoTHAHHx npoAynxoB (Ila) h (116), o6 P a 3 yK>ia.HxcH b nv^nsvL- 
xejibiio paBHUx KOJiH^ecxBax, HeopraHanecKoro (J>oc$axa h He6ojibmHx ko- 

^llxox peat "bxaJvKaaHBaeT Ha 3HawrejibHoe pacmenjieHHe nH po(poc4>aTHoii 
CBH3H HVKJieoTHfla b ycjioBHHx peaKUHH. B onncaHHHx pa H ee npHMepax 
no™eZl CMemaHHHx aHr H fl P H«0B ADP h GDP xanoro Pac£eu«n» 
nnpoAocpaxHOH cbh3H c o6pa30BaHH6M upohbboahhx xana (Ila) He Ha6jno- 
sajiocb, xoxh 6hjio oxMeieHO noaBJieHne neopraHHiecKoro $oc*axa (-15 /o) 
noH B3aHMoneHCXBHH MesCOCl c ATP [9]. • _ 

riponvKxbi pea^HH 6ujih paanejieHH c noMombio hohooomchhoh xpoMa- 
TorpaiHH na DEAE-uejuiiOJiose; cMeiuaHHbie aHr H fl P H«bi (Ila) n (116) 6hjih 
BbmejieHH c BbixonaMH 39 h 37% cooxBexcxBeHHo (cm. xa6ji H u.y). 

HneHXHTOoexb YO-cneKxpoB hcxoahhx h nojiyneHHbix coefliraeHHH 
cBHAexejibcxsyex 06 oxcyxcxBHH H3Me H eH H fi rexepoHHKJiH^ecKoro ocHOBaHHH 
BejiHiHHH oxHonieHHH HyKJieoanfl: <J>oc<pop noKasuBaroT, hto (Ila) hbjih6xch 

np0H3B0flHbIM UMP, a (116) - np0 H 3B0«HHM UDP. n P H KHCJIOXHOM r H fl P 0- 

jiiise aia) h (116) o6pa3yexca M e3HXHJieHKap6oHOBafl Kncaoxa h cooxBexciByio- 
m H e HyKJieoTHHbi. B cnenxpe nMP hmcioxch c H r H aabi xapaKxepHHe n«H 
ocxaxKa Me3HXHJieHKap6oHOBoii khcjioth h yp H « H Ha. CoBOKynnocxb Bcex 

3XHX AaKTOB n03B0JIfleX 0flH03HaiH0 npHHHCaXb COeflHHeHHHM (Ila) H (110) 

cxpyKxypbi CMemaHHHx a H rH3pHflOB M e3HXHJieHKap6oHOBOH khcjioxh c y P H- 

H0H-5'-MOHO- H flH4)0C$aT0M COOTBeXCXBeHHO. 

BaaHMoaeiicxBHe (la) c xjiopaHrH3pH«oM M e3HXHJieHKap6oHOBOH khcjioth 
rjiaflKO np H BOflnx k cMemaHHOMy a H rH W nfly (Ila), koxophh noc^e ohhctkh 
noHoo6MeHHoii xpoMaTorpa<p H eH no^yneH c bbixoaom bd A. 

fljifl noJivneHHH HyKJieoxnflHHx jiHraHflOB co cbo6oahoh aMHHorpynnoH 
CMemaHHHe aarHflpHflH (Ila) a (116) bboahjih b pea^Hio c reKcaMex^eHflH- 
aMUHOM, HcnojibayH 20-30-KpaxHbifl H3 6hxok nocjieflHero «flH npe fl OTBpame- 

HHH B03M 05 KHOrO 06 P a3OBaHHH CHMMexpHIHHX 6HC-(i.0Cj0aM H flHbIX HP0H3- 

boahhx. PeaKAHH aaKan^Baaacb 3a 5-7Hnp H 20° h aa 20 h np H 5 (Docjo- 
qjinn (Ilia), npoflyKx KOHAenca^n cMeniaHHoro anrHApHAa (Ha) c rencaMe- 
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Coe^HHeHHe 


B CHCTeMaX 


fl UMP 
b Gytjjepe B 


OTHomeHHe ypHnnn : o<5- 

njHtt <J)OC<j)Op : KHCJIOTO- 

Tf O ^ TF ITI M IT gtlt\f* A\t\l\ * 

jlttu HJl b HblH (pUUtJJOp . 

: aMHHorpynna 


A 


B 


(Ila) 
(H6) 
("la) 
("16) 


0,80 
0,68 
0,47 
0,29 


0,78 
0,64 
0,28 
0,18 


0,60 
0,85 
0,35 
0,65 


1,00 : 1,03 : 0,00- 
i,00 : 2,15 :0,99- 
1,00 : 1,06 : 0,00 : 1,09 
1,00 : 2,14 : 1,07 : 1,07 



THJieHflnaMHHOM, BHflejieH H3 peaKn;HOHHOH cmgch hohooSmghhoh xpoMaTorpa- 
$hgh Ha aaysKce 1x8: nonHTKa Hcnojib30BaHHH Sojigg cjiaSoro aHHOHHTa — j 
DEAE-n;ejiJiiojio3H — 6mia HeycnemHoa. nocjie o6eccojiHBaHHH rejib-^njib- j 
TpaijHeH Ha CG$aA6KCG G-15 bhxoa (I I la) cocTaBHJi 63%. 

CTpyKTypa cogahhghhh (Ilia) onpeaejieHa othoihghhgm HyKJieo3HA : $oc- 
<$op : aMHHorpynna (cm. TaSjinijy), a TaK>Ke AaHHbiMH KHCJioraoro rHApojiH3a, '] 
b pe3yjibTaie KOToporo H3 <J)oc$oaMHAa (Ilia) o6pa3yeTCH reKcaMGrajiGHAH-,* 
aMHH h UMP. ?1 

fljia BHflejieHHH $oc$oaMHffa (III6) H3 peaKHjHOHHOH cmgch b KanecTBe> 
HOHooSMeHHHKa HCnojib30BaHa DEAE-u;ejiJiK)Jio3a; cogahhghhg (III6) no- : 

Jiy^ieHO B BHfle TpH3THJiaMM0HHGB0H COJIH C BHXOflOM 53%. AHaJIHTH*IGCKH6 1 

AaHHHe (cm. Ta6jiHijy), a TaiOKG chgktp IIMP noATB6p>KAaiOT ero cipyKTypy. 

Ooc$oaMHAH (Ilia) h (III6) MoryT 6brrb nojiyHGHH, hcxoah H3 UDP, 
6e3 npoMe>KyTOTOoro BbiAGJieHHH CMemaHHbix aHrHApHAOB (Ila) h (116). 
B TaKOM Bapname <£oc<j)oaMHA (1 1 1 6) bhagjigh hohooSmghhoh xpoMaTorpa$Hea ; 
Ha DEAE-jjejuiioJiose c bhxoaom 31%; no-BHAHMOMy, <£oc$oaMHA (IIIaj : 

MO>KGT 6bITb BHAGJIGH H3 HpOMHBHHX BOA HOHOOSMGHHOH XpOMaTOrpa$H6H Ha : 

AayaKCG 1x8. IIo ji yHGHHBiS b oAHy CTaAHio npcnapaT (III6) coAGp>Kaji hg- 
6ojibrao6 kojihhgctbo hphmgch, KOiopan 6biJia OTAGjiGHa npenapaTHBHbiM ; 
3JiGKTpo$opG30M Ha 6yMaro. 

IIojiyHeHHG aACopSGHTOB a^h a$$HHHOH xpoMaTorpa$HH (IVa) h (IV6) 

eCymeCTBJIGHO B3aHM0A6HCTBH6M $OC(j>OaMHAOB (Ilia) H (III6) C aKTHBHpO- 
BaHHOH 6pOMIJHaHOM CG$ap030H 4B. J\llK aKTHBaUiHH HOCHTGJIH npHMGHGHa 
MOAH$HKa^H HGAaBHO OHHCaHHOH MGTOAHKH aKTHBaiI,HH CG<|>ap03H B IIJGJIOH- 

hom $oc<f>aTHOM 6y<|)6p6 [13], cocTonman b 3aMeHe BOAHoro pacTBopa 6poM- 
ipiaHa pacTBopoM b aijGTOHHTpHJiG. 3tot BapnaHT aKTHBaijHH rejin OKa3aJica 

60JIG6 3(|)(j)GKTHBHblM, HGM aKTHBaiJHH BIJGTOHHTpHJIbHblM paCTBOpOM SpOMIJHa- 

Ha B KapSoHaTHOM 6y$epG [14] *, h no3Bojinji noJiy^aTb bochpoh3boahmhg 

pG3yjIbTaTH. 

KOHAGHOaiJHH $OC$OaMHAOB (Ilia) H (III6) C aKTHBHpOBaHHOH SpoMijna- 

hom ce(|)apo30H 4B npoBeAGHa npn pH 8,3. OcTa-roTObie aKTHBHbio ijghtph 

COpSGHTa yAaJIHJIH 06pa60TK0H 3TBHOJiaMHHOM. B yCJIOBHHX pGBKIJHH cymecr- 
BGHHOrO THApOJIH3a $OC$OaMHAOB HG npOHCXOAHJIO, ^TO H03BOJIHJIO HX HCHOJIb- 
30BaTb A^H HOBTOpHOH KOHAGHCaiJHH. BblJIH HOJiyHGHH aACOpSGHTBI (IVa) 

h (IV6), coAGpTKanjHG 4,8 mkmojib $oc$oaMHAa (Ilia) h 5,2 mkmojib cogah- 

HGHHH (III6) Ha 1 MJI TGJIK COOTBGTCTBGHHO. 

IIoJiyqGHHHG aACOp6GHTH OKa3aJIHCb yCTOHTOBHMH B yCJIOBHHX a$$HHHOH 

xpoMaTorpa$HH npn pH 8,1—8,5 h npn xpaHGHHH b HGHTpajibHbix pacTBopax 
npa noHHH^GHHOH TGMnGpaTypGj TaK, HanpHMGp, 3a 30—40 ahgh npn 4° b pac- 
TBope 1 M xjiopncToro HaTpnH CTGHGHb agcop6i^hh yO~norjiou;aiom6ro 
MaTGpnajia b cjiyqao aACop6GHTa (IVa) cocTaBHJia mghgg 10%. 

Ctghghb HMMo6njiH3ai^HH nojiy^GHHoro ypHAHH$oc$oaMHAa cpaBHHMa 

C JIHTGpaTypHblMH AaHHHMH HO HMMo6HJIH3aK,HH ypHAHHOBHX HyKJIGOTHAOB 
(|)OC(|)03$HpHOH CBH3BK) C HOMOD^bK) 7l-aMHHO$6HHJIOBbIX 3$HpOB [6, 7] HJIH 

b cjiy^ao HHpo$oc(|)aTOB c noMombio 6-aMHHoreKCHJiOBoro 3$npa [8]. 



* IlpH aKTHBaAHH rejiH b Kap6oHaTHOM 6yAepe Ao6aBjieHH6M pacTBopa 6poMnnaHa b ■# 
aii,eTO HMTpHa€ , rfaK 3to onHcaHO b pa6oTe [14], deneHb HMMo6HJiH3anHH AocAoaMHnoB 
He npeBMniajia 0,5—0,8 mkmojib/mji rejia. 
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Trkhm o6pa30M, onHcaHHbifi b HacxoHmeH pa6oxe mcxoa nojiyneHHH cop- 

6eHTOB C HMM06HJIH30B8HHHMH y P HAHHOBHMH HyK«eOTHflaM0, CBHSaHHHMH 
Aoc&oaMHAHOH cBH3bio c MaxpHneii, nOSBOJIHOX npOCTO H C BHCOKOH 9*<peR> 
THBHOCTbH) OCymeCTBHTb npHCOeffHHeHHe 0CX3XK0B HyKJI eOTHflOB k MaxpHne, 

H e coAepwamea cbo6oahhx aMHHorpynn. nojiyienHHe a«cop6e H TH MoryT 
oKasaTbCH nojiesHUMH npn BH«ejieHHH h hcc ji 6ao b aHHH (pepMCHTOB, npoHB- 
nHKumx cpoACTBo k ypHflHHOBHM HyKJieoTHflaM. Hanoi npeABapirreJibHHe 

OnHTH CBHfl6TeJIbCTByH)T 0 BHCOKOH 3$$6KXHBH0CXH COp6eHTa (lVa) npH OIHCT- 

K e AepMeHTa, Kaxajmanpyiomero nepenoc ocxaxKa ranaKxo3HJi*oc<i>aia c ypn- 
nHH*oc*aTrajiaKT03H H a nojranpeHOJi$oc$aT npn SnocHHTeae 0-cnenH$irae- 
cKoro nonHcaxapHfla cajibMOHejui. IlpHMCHeHHbiH b «a H HOH pa6oxe uoaxoa, 
oMeBHAHO, Mowex 6uTb pacnpodpaHeH h Ha Apyrne HyKJieoxHAH. 

3KcnepHM6HTajibHaH iacTb 

B pa6o T e HcnojibSOBajm UMP h UDP (Reanal, BHP), xpH-K-OKTHJiaMHH 
(Schuchardt, <DPr), ce$afleKC G-15 h cefcaposy 4B (Pharmacia, HIbbi^hh), 
DEAE-i;enJiK)Jio3y (Whatman, Ahi-jiba). _ 

AHajiHTHiecKyro xpoMaxorpa<pHio h 3jieKxpo<pope3 iipoboahjih Ha oy M are 
FN-11 (Filtrak, r^P), a npenapaxnBHyio — Ha Whatman 3 MM (AHrjiHH). 
npH BX HcnojibaoBajiH chcxcmh: A - sxbhoji - 1 M anexax aMMOHHH, 
5-2, pH 7,5; B - naonponaHOJi - aMMHaK - BOAa, 7:1:2; 3JieKxpo$o- 
pea npoBOAHJia b 0,05 M T pH3XHJiaMMOHHH6HKap6oHaxHOM oyjepe, pii Ifi 
(6v(J)ep B) Ha npn6ope nB80>-l npn rpaAHekxe HanpHweHHH lb B/cm. 

0)oc4>opcoAep5Kam;He a BemecTBa c nepBHTOOH aMHHorpynnoH o6Hapy- 
WHBajiH Ha xpoMaTorpaMMax h 3JieKTpo4>operpaMMax c noMombio peare H xa 
XaHHca-HinepByAa h pacxBo'pa HHHrHApHHa cooxBexcxBemio (nponncH 
D45a h D112a b [15]). KojiHiecxBeHHoe onpeAejiemie KHCJioxojiaonjibHoro 
H o6mero Aoc<popa hpoboahjih no MexoAaM pa6o T [16] h [17] cootb6tctb6hho» 
nepBHiHOH aMHHorpynnw - pacxBopoM xp0HHxpo6eH3OJicyjib(poHaxa UoJ 

C HCHOJIb30BaHHeM A«H KajIHSpOBKH 3XaHO JI BMHHa . 

nHPHAHH 1 a6cOJIH)THpOBaJIH KHHH^eHHeM H8A P 2 0 5 , aneTOHHTpHJI — H8A 

CaH„ 6e H 30Ji - HaA HaxpneM h 3xa H oji - hba MarHHeM; bco paciBopHxejin 
neperoHHJiH nepeA ynoxpe6jieHHeM. XjiopanmAPHA M63HXHJi8HK8p00H0B0H 
khcjioxh (MesCOCl) h 6poMUiiaH nojiynajin no pnncaHHHM : MexoflHicaM liyj 

" ^CneKT^bi B r^? e cHHMajiH . b 2 0 «a np H 6ope WP-60 (Brute, <DPD c na- 
KonneHneM b HMnyjibCHOM pe* HM e (10 npoxo^AeHnn) c noAaBjienneM chx- 
Hana pacxBopnxejiH (AHOKcan b KaiecxBe BnyxpeHHerocxaHAapxa); y<P-cneK- 
Tpw CHHMajiH Ha cn6KTpo$OTOM6Tpe Unicam SP-8000 (Ahi-jim). 

KoHdencauuii xAopamudpuda Me3umuAeHKap6oHoeou kucaothu c uur. 
Boahhh pacTBop yp H AHH-5'-AH4>oc(paTa (0,3 mmojh,) riponycKa^n nepea ko- 
jiOHKy (I X 5 cm) c AayaKCOM 50 (H+-<pop M a), HeHT P ariH30BaJiH hephahhom 
ii ynapUajiH AO 1—2 mji. K ocxaxKy AoSaBjiHJiH 0,45 MMOjib x P h-k-okthji- 
a'MHHa b 1,5 mh nnpHAHHa, BCxpHXHBajia ao roMoreHHoexn, pacxBop ynapHBann 
a ocxaxoK xmaxejibHO BbicyniHBajiH MHoroKpaxHofi oxroHKOH co CMecbio aec. 
C.H.- a6c. EtOH (1:1). nojiyHeHHHii ocxaxoK pacxBopaaH b 5 . mji cmcch 
a6c C,H 5 N - a6c. CH 3 CN (3 : 1), Ao6a B jiHJiH 3,74 mmojib (0,6 mji) MesCOCl, 
3HeprHHH0 BcxpaxHBajiH b xeieHH6 8 mhh n P H 20°, «o6aMHaH_12 MJI boah, 
MesCOOH aKCxparnpoBaJiH 3<ph P om (6 X 5 m«); 3(p H pHbiH cjioh npoMHBajin 

BOAOH (1X5 MJI). 06BeAHHeHHHH BOAHUH paCXBOp pa36aBJIHJIH BOAOH AO 

300 mji a H8H0CHJIH Ha KOJioHKy (3 X 24 cm) c DEAE-nejuiiojio30H 
(HCO,--4o P Ma); KOJioHKy npoMUBajin boaoh ao oxcyxcTBHH norjiomeHHH npn 
260 hm (390 mji) h 3JiibHpdBajiH jihhohhhm rpaAHeHxoM .KonneHX-panHH xpn- 
3x H jiaMMOHHfi6 H Kap6oHaxa (TEAB) ox 0 ao 0,4 M (no I n). <D P aKH, HH , coAep- 
JKamHe yO>-norjioma»)mHH Maxepnaji, o6T>eAHHHJiH h jiH0$HJiH3OBajiH c AooaB- 
jieHHeM boah (5 X 10 mji) ajih yAaJieHna TEAB. BHX 0 0 Q V P ^ WM TnS BO « 
cojih (Ha) VrcnpyexcH 0,07 M TEAB) cocxaBHJi 39% (117 mkmojib), a 
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rpH^THJiaMMOHHGBOH cojih (116) (ajuoapyeTCH 0,14 M TEAB) — 37% 
(111 mkmojib). XpoMaTorpa^H^ecKne h 9JieKTpo$opeTHHecKHe iio^bh>khocth, 
a T&K?Ke aHajiHTH^ecKne ^ammo npHBeaeHH b Ta6ji*me. 

Ilpn raApojiH3e (0,1 h. HC1, 100°, 2 h) aHriiflpHflOB (Ha) h (116), no AaH : 
hum BX h 9JieKTpo$ope3a, o6pa3yK)TCH MesCOOH, UMP, a TaiOKe UDP 
b cjiynae (116). CneKTp IIMP aMMomteBOH cojih (116) (8, m. a.): 7,69 a (6-H 
/ 5 , 6 8 Tii;); 6,74 c (npoTOHH 6eH30JibHoro HApa); 5,84 m (5-H + l'-H); 2,15 c 
a 2,08 c (3 CH 3 ). 

CMeuiannuu amudpud Me3umujienKap6oHQeou nucjiombi c UMP {I I a). 
Tpn-w-OKTHJiaMMOHHeByio cojib UMP (0,6 mmojib), nojiyneHHyio no BHrneoira* 
caHHofi MOTOAHKe, TmaTejibHo BHcymHBajiH, pacTBopHJiH b 10 mji a6c. nnpH- 
AHHa h o6pa6aTMBajiH 7,84 mmojib (1,2 mji) MesCOCl b ycjioBnax, anajiofHH- 
hhx nojiy^eHHio (116). Bhxoa anrHApnAa (Ila) cocTaBHJi 63% (378 MKMOJib). 
AHajiHTH^ecKne AaHHHe ajih cbefliraeHHH (Ila), a TaK>Ke ero xpoMaTorpa$n^ie- 

CKHe H 3JieKTpO$OpeTHHeCKHe XapaKTepHCTHKH npHBOfleHH B Ta6jIHD[e. 

N^iypudun-S' -$oc$opuA)-l fi-duaMUHoeencan (Ilia). K pacTBopy 
0,19 mmojib (Ila) b 2 mji boam npa 20° npH6aBjiHJiH no KanjiHM b Te^emie 
10 mhh 6 MMOJib cBe>KeneperHaHHoi?o reKCaMeTHJieHAHaMHHa b 1,5 mji boah. 
tlepea 20 h npn 4° peaKijHOHHyio CMecb pa36aBjiHJin ao 1 Ji, HaHOCHJIH Ha ko- 
jioHKy (3 X 25 cm) c DEAE-n;ejiJiK)ji030H (HC0 3 ~-$opMa) h npoMHBajin boaoh' 
(200 mji). SjiionpoBajiH jiHBtefiHHM rpaAHeHTOM pacTBopa TEAB ot 0 ao 
0,5 M no 1 ji. Ochobhog BenjecTBo (~74%) oKa3ajiocb b dpomhbIhhx BOAax, 
b to BpeMH KaK b nepBon $paKn;HH — Bcero okojio 12% coeAHHeHHH (Ilia). 
Bo BTopofi $paKi^HH o6Hapy>KeHO okojio 8% ncxoAHoro UMP. IIpoMHBHHe 1 

BOAH H n6p By 10 $paKH,HK) 06l>eAHHHjlH H HaHOCHJIH Ha KOJIOHKy (1,5 X 18 cm) 

c AayaKCOM 1x8 (HC0 3 "-$opMa) ripn 4—10°. KojiOHky npoMMBajin boaoh 
(500 mji) h 3jiK>HpoBajiH BemiecTBa c kojiohkh pacTBopaiviH TEAB: 0,05 
(200 mji), 0,075 (200 mji), 0,1 (500 mji), 0,125 (250 mji) h 1 M (110 mji). OcHOBHoe 
KOJin^ecTBo BemecTBa (93%) CMHBajiocb c kojiohkh 0,1 M TEAB, 3Ty $paK- 

UHH) MHOrOKpaTHO JIHO$HJIH30BaJIH c Ao6aBJieHH6M boah AJi« yAajieHHH TEAB. 
OcTaTOK paCTBOpHJIH B 2,5 MJI BOAH H o6eCCOJIHBaJIH Ha K0JI0HK6 (1 ,5 X 75 cm) 

p ce$aAeKC0M G-15; <J>paKH.Hio, coAepncamyio (Ilia) (63%), jiH0<f>HJiH30BajiH. 
AiiajiHTHHecKHH o6pa3ei^ (Ilia) (cm. TaCJjraijy) nojiynajiH npenaparaBHHM 
pjieKxpo(|)ope30M Ha 6yMare Whatman 3 MM b 6y$epe B. Ilpn khcjiothom 
mApojiH3e coeAHHeHHH (Ilia) (0,1 h. HC1, 100°, 2 n), no AaHHHM BX h 
•3jieKTpo$ope3a, Ha6jiK)Aajiocb o6pa30BaHHe reKCaMeTHJieHAHaMHHa h UMP. 

N^iypudun-S' ' -nupocf)OC(f)opuji)-l fi-duaMUHozencan (1 1 16). A. K pacT- 
Bopy 55 MKMOJib (116) b 0,5 mji boah AoSaBjiajin no KanjiHM b TeneHne 10 mhh 
pacTBop 1 mmojib cBeJKenepernaHHoro reKCaMeTHJieHAHaMHHa b 0,5 mji boah. 
*Iepe3 20 /q npn 20° peaKijHOHHyio CMecb pa36aBjiajiH boaoh ao 500 mji 
h HaHOCHJIH Ha KOJioHKy (3 X 23 cm) c DEAE-ijejijiiojio30H (HC0 3 ~-$opMa); 
KojiOHky npoMHBaJiH boaoh (300 mji) h jihhohhhm rpaAHeHTOM TEAB 
pH 7,5, ot 0 ao 0,36 M (no 1 ji). OpaKijHH, coAGpjKan^HB yO-norjiomaiomne 
BemedBa, 06'beAHHajiH h MHoroKpaTHO JiHo$HJiH30BajiH c Ao6aBjieHneM 
boah. Bhxoa (III6) cocTaBHJi 29 MKMpjib (53%). AnajiHTHHecKHH o6pa3eD[ 
(cm. Ta6jiHn,y) nojiynen npenaparaBHOH BX b CHCTeMe A. CneKTp IIMP aMr 
MOHHeBofi cojih (III6) (6, m. a-): 7,89 a (6-H, / 5 , 8 8 T^; 5,89 m (5-H + l'H); 
2,87 t (CH 2 npn C e reKCaMeTHJieHAHaMHHa) ; 1,245; 1,146; 1,02 h 0,92 (5 CH 2 , 
npn C(d— C( 5 ) reKCaMeTHJieHAHaMHHa). 

B. 0,6 MMOJib TpH-H-OKTHJiaMMOHHeBOH COJIH UDP paCTBOpHJIH B CMeCH 

9 mji a6c. nnpHAHHa h 1,5 mji a6c. aijeTOHHTpnjia,. Ao6aBjiajiH 1,2 mji 
(7,84 MMOJib) MesCOCl h aHeprnHHO BCTpaxnBaJiH 8 mhh npn 20°. flo6aBjiajiH 
12 mji boah, 3KCTpan?poBajiH 3(j)HpoM (5 X 10 mji). B boahhh cjioh, ynapeH- 
huh ao 1 mji, Ao6aBjiajiH b Te^eHHe 20 mhh 6 MMOJib CBeH^eneperiiaHHoro reK- 
CaMeTHJieHAHaMHHa b 1,2 mji boah. Hepes 20 ^ npn 3° peaKi^HOHHyio QMecb 
pa36aBjiajiH boaoh ao 1 ji h HaHOCHJIH Ha KOJioHKy (2,5 X 30 cm) c DEAE- 
u;ejiJiio^i030H (HC0 3 T-$opMa); KOJioHKy npoMtiBajiH boaoh (500 mji) h jihhch- 
hhm rpaAneHTOM TEAB ot 0 ao 0,4 M no 1 ji. OpaKipH, coAepjKamne YO-no- 
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rjionjaK>m,He nemecTBa, o6i>eflHHflJiH h MHoroKpaTHo jiHO$HJiK30BajiH c flo- 
6aBJieHHeM boam. Bhxo« (III6) — 184 mkmojib (31%). 

Anmueai^UH cecpapo3bi 4B 6poMi^uanoM t K 1 mji ce$apo3H 4B floSaBjiajiH 
i MJi 5 M $oc$aTHoro 6y$epa, pH 11,9, nojiyneHHyio cycneH3HK> oxjiarcaaJiH 
20 4° h Ao6aBJiHJiH npH nepeMemnBaHira pacTBop SpoMi^nana (100 Mr) b aije- 
TOHHTpnjie (0,1 mji). Hepe3 10—15 mhh nepeMeniHBaHHH npn 4° rejib npoMH- 
B aJiH Ha CTeKJiHHHOM $HJitTpe 10 mji 0,25 M padBopa NaHC0 3 h HeMefljieHHO 
n 0 6aBJiHJiH cooTBeTCTByiom,HH jinraHfl. 

UMMo6uAU3aiiUH $oc&oaMuda (Ilia) na ce$apo3e (IV a). K 1 mji cBe>Ke- 
aKTHBupoBaHHOH 6poMi^HaHOM ce$apo3bi 4B b i^GHTpH<J)y)KHOH npoSnpKG 
A o6aBJiHJiH 1 mji 0,25 M pacTBopa NaHC0 3 , pH 8,3, coaeprcamero 103 mkmojib 
(Ilia), h ocTaBjifljin npn 4° Ha 20 h npn nepeMeniHBaHHH. CMecb ijeHTpn^y- 
rapoBajiH npn 2000 o6/mhh, cynepHaTaHT, coflepjKanjHH 83 MKMOJib (Ilia), 
jnio<|>HJiH30BajiH h Hcnojib30BajiH ajih noBTopHOH KOHAeHcauHH. K ocaAKy 
reJiH Ao6aBjifljin 1 mji 0,25 M 3TanojiaMHHa h nepeMeniHBajiH 2 h npn 20°. 
Tejib nepeHOCHJiH b KOJioHKy h npoMUBajin nocjieAOBaTejibHo 20 o6teMaMH 
0,1 M CH 3 COONa, 6 M MOHeBHHH, 0,1 M NaHC0 3 (bcg ajiioeHTbi coAepwajin 
0^5 M NaCl) h, HaKOHeni, 1 M NaCl. GreneHb KOBaJieHraoro npncoeflHHeHHfl 
(Joc^oaMHAa (Ilia), onpeflejieHHan no yO-norjiomeHHK> h KOjimecTBy o6mero 
$oc$opa, nepexoAHmero b pacTBop nocjie nmpojiH3a (0,6 h. HC1, 20°, 16 
aJiHKBOTbi (0,1 mji) aACopSema (IVa), cociaBHJia 4,8 mkmojib (Ilia) Ha 1 mji 
rejia. 

HMMo6u.iU3aitusi 0oc$oaMuda (1 1 1 6) na ce$apo3e (IV 6). 1 mji cBevneaKTH- 
BnpoBaHHOH 6poMunaHOM ce<ftapo3H 4B o6pa6aTbiBajra 73,0 mkmojib $oc<f>o- 
aMHAa (1 1 16) no BHineonncaHHOH MeTOflHKe. KojiH^ecTBo HMMo6HJiH30BaHHoro 
jinraHAa, onpeAejieHHoe no yd>-norjiom;eHHio (5,2 mkmojib) h o6meMy $oc$opy 
(10,3 MKMOJib) b rwApojiH3aTe aacopSeHTa, cocTaBHJio 5,2 MKMOJib (III6) 
na 1 mji ce(j)apo3bi. 

Abtoph raySoKo 6jiaroAapHH H. H. Cokojioboh 3a u;eHHbie o6cy>KfleHHH 
h A. C. IUauiKOBy 3a cteMKy cneKTpoB ITMP. 
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NEW AFFINITY ADSORBENTS DERIVED FROM URIDINE 
NUCLEOTIDE PHOSPHOAMIDATES 



SH1BAEV V. N., KUSOV Yu. Yu. , KALINCHUK N. 
KOCHETKOV N. K. 



A., 



N. D. Zelinsky Institute of Organic Chemistry, 
Academy of Sciences of the USSR, Moscow 

Biospecific adsorbents for uridine nucleotide-linked enzymes were prepared througl 
reaction of NMuridine-5'-phosphoryl)-or NMuridine-5'-pyrophosphorylH,6-diaminS 
hexaneswithBrCN-activated sepharose. The adsorbents contained immobilized uridinl 
5 -phosphate or-pyrophosphate residues linked to matrix through phosphoamide bond! 
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